Gene Gene information from NCBI Gene database.
Entrez ID 64324
Gene name Nuclear receptor binding SET domain protein 1
Gene symbol NSD1
Synonyms (NCBI Gene)
ARA267KMT3BSOTOSSOTOS1STO
Chromosome 5
Chromosome location 5q35.3
Summary This gene encodes a protein containing a SET domain, 2 LXXLL motifs, 3 nuclear translocation signals (NLSs), 4 plant homeodomain (PHD) finger regions, and a proline-rich region. The encoded protein enhances androgen receptor (AR) transactivation, and this
SNPs SNP information provided by dbSNP.
365
SNP ID Visualize variation Clinical significance Consequence
rs61756006 T>C Likely-benign, benign, conflicting-interpretations-of-pathogenicity Coding sequence variant, synonymous variant, 5 prime UTR variant, genic upstream transcript variant
rs113856002 A>G Likely-benign, conflicting-interpretations-of-pathogenicity 5 prime UTR variant, genic upstream transcript variant, coding sequence variant, missense variant
rs121908067 C>G Pathogenic Stop gained, coding sequence variant, genic upstream transcript variant, 5 prime UTR variant
rs121908068 C>G,T Likely-benign, pathogenic Synonymous variant, missense variant, coding sequence variant, genic downstream transcript variant
rs121908069 G>C Pathogenic Missense variant, coding sequence variant, genic downstream transcript variant
miRNA miRNA information provided by mirtarbase database.
1049
miRTarBase ID miRNA Experiments Reference
MIRT046367 hsa-miR-23b-3p CLASH 23622248
MIRT045467 hsa-miR-149-5p CLASH 23622248
MIRT045301 hsa-miR-186-5p CLASH 23622248
MIRT040128 hsa-miR-615-3p CLASH 23622248
MIRT040128 hsa-miR-615-3p CLASH 23622248
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
41
GO ID Ontology Definition Evidence Reference
GO:0000122 Process Negative regulation of transcription by RNA polymerase II ISS
GO:0000785 Component Chromatin IBA
GO:0000978 Function RNA polymerase II cis-regulatory region sequence-specific DNA binding IDA 20837538
GO:0003682 Function Chromatin binding ISS
GO:0003712 Function Transcription coregulator activity IDA 11509567
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
606681 14234 ENSG00000165671
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
Q96L73
Protein name Histone-lysine N-methyltransferase, H3 lysine-36 specific (EC 2.1.1.357) (Androgen receptor coactivator 267 kDa protein) (Androgen receptor-associated protein of 267 kDa) (H3-K36-HMTase) (Lysine N-methyltransferase 3B) (Nuclear receptor-binding SET domain
Protein function Histone methyltransferase that dimethylates Lys-36 of histone H3 (H3K36me2). Transcriptional intermediary factor capable of both negatively or positively influencing transcription, depending on the cellular context. {ECO:0000269|PubMed:21196496}
PDB 3OOI , 6KQP , 6KQQ , 8C5L
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00855 PWWP 321 431 PWWP domain Domain
PF00855 PWWP 1754 1846 PWWP domain Domain
PF17907 AWS 1901 1939 AWS domain Domain
PF00856 SET 1953 2059 SET domain Family
PF17982 C5HCH 2161 2210 NSD Cys-His rich domain Domain
Tissue specificity TISSUE SPECIFICITY: Expressed in the fetal/adult brain, kidney, skeletal muscle, spleen, and the thymus, and faintly in the lung.
Sequence
MDQTCELPRRNCLLPFSNPVNLDAPEDKDSPFGNGQSNFSEPLNGCTMQLSTVSGTSQNA
YGQDSPSCYIPLRRLQDLASMINVEYLNGSADGSESFQDPEKSDSRAQTPIVCTSLSPGG
PTALAMKQEPSCNNSPELQVKVTKTIKNGFLHFENFTCVDDADVDSEMDPEQPVTEDESI
EEIFEETQTNATCNYETKSENGVKVAMGSEQDSTPESRHGAVKSPFLPLAPQTETQKNKQ
RNEVDGSNEKAALLPAPFSLGDTNITIEEQLNSINLSFQDDPDSSTSTLGNMLELPGTSS
SSTSQELPFCQPKKKSTPLKYEVGDLIWAKFKRRPWWPCRICSDPLINTHSKMKVSNRRP
YRQYYVEAFGDPSERAWVAGKAIVMFEGRHQFEELPVLRRRGKQKEKGYRHKVPQKILSK
WEASVGLAEQY
DVPKGSKNRKCIPGSIKLDSEEDMPFEDCTNDPESEHDLLLNGCLKSLA
FDSEHSADEKEKPCAKSRARKSSDNPKRTSVKKGHIQFEAHKDERRGKIPENLGLNFISG
DISDTQASNELSRIANSLTGSNTAPGSFLFSSCGKNTAKKEFETSNGDSLLGLPEGALIS
KCSREKNKPQRSLVCGSKVKLCYIGAGDEEKRSDSISICTTSDDGSSDLDPIEHSSESDN
SVLEIPDAFDRTENMLSMQKNEKIKYSRFAATNTRVKAKQKPLISNSHTDHLMGCTKSAE
PGTETSQVNLSDLKASTLVHKPQSDFTNDALSPKFNLSSSISSENSLIKGGAANQALLHS
KSKQPKFRSIKCKHKENPVMAEPPVINEECSLKCCSSDTKGSPLASISKSGKVDGLKLLN
NMHEKTRDSSDIETAVVKHVLSELKELSYRSLGEDVSDSGTSKPSKPLLFSSASSQNHIP
IEPDYKFSTLLMMLKDMHDSKTKEQRLMTAQNLVSYRSPGRGDCSTNSPVGVSKVLVSGG
STHNSEKKGDGTQNSANPSPSGGDSALSGELSASLPGLLSDKRDLPASGKSRSDCVTRRN
CGRSKPSSKLRDAFSAQMVKNTVNRKALKTERKRKLNQLPSVTLDAVLQGDRERGGSLRG
GAEDPSKEDPLQIMGHLTSEDGDHFSDVHFDSKVKQSDPGKISEKGLSFENGKGPELDSV
MNSENDELNGVNQVVPKKRWQRLNQRRTKPRKRMNRFKEKENSECAFRVLLPSDPVQEGR
DEFPEHRTPSASILEEPLTEQNHADCLDSAGPRLNVCDKSSASIGDMEKEPGIPSLTPQA
ELPEPAVRSEKKRLRKPSKWLLEYTEEYDQIFAPKKKQKKVQEQVHKVSSRCEEESLLAR
GRSSAQNKQVDENSLISTKEEPPVLEREAPFLEGPLAQSELGGGHAELPQLTLSVPVAPE
VSPRPALESEELLVKTPGNYESKRQRKPTKKLLESNDLDPGFMPKKGDLGLSKKCYEAGH
LENGITESCATSYSKDFGGGTTKIFDKPRKRKRQRHAAAKMQCKKVKNDDSSKEIPGSEG
ELMPHRTATSPKETVEEGVEHDPGMPASKKMQGERGGGAALKENVCQNCEKLGELLLCEA
QCCGAFHLECLGLTEMPRGKFICNECRTGIHTCFVCKQSGEDVKRCLLPLCGKFYHEECV
QKYPPTVMQNKGFRCSLHICITCHAANPANVSASKGRLMRCVRCPVAYHANDFCLAAGSK
ILASNSIICPNHFTPRRGCRNHEHVNVSWCFVCSEGGSLLCCDSCPAAFHRECLNIDIPE
GNWYCNDCKAGKKPHYREIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFG
SNDYLWTHQARVFPYMEGDVSSKDKMGKGVDGTYKKALQEAAARFE
ELKAQKELRQLQED
RKNDKKPPPYKHIKVNRPIGRVQIFTADLSEIPRCNCKATDENPCGIDSECINRMLLYEC
HPTVCPAGGRCQNQCFSKR
QYPEVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEE
CRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKWSVNGDTRV
GLFALSDIKAGTELTFNYN
LECLGNGKTVCKCGAPNCSGFLGVRPKNQPIATEEKSKKFK
KKQQGKRRTQGEITKEREDECFSCGDAGQLVSCKKPGCPKVYHADCLNLTKRPAGKWECP
WHQCDICGKEAASFCEMCPSSFCKQHREGMLFISKLDGRLSCTEHDPCGPNPLEPGEIRE
YVPPPVPLPPGPSTHLAEQSTGMAAQAPKMSDKPPADTNQMLSLSKKALAGTCQRPLLPE
RPLERTDSRPQPLDKVRDLAGSGTKSQSLVSSQRPLDRPPAVAGPRPQLSDKPSPVTSPS
SSPSVRSQPLERPLGTADPRLDKSIGAASPRPQSLEKTSVPTGLRLPPPDRLLITSSPKP
QTSDRPTDKPHASLSQRLPPPEKVLSAVVQTLVAKEKALRPVDQNTQSKNRAALVMDLID
LTPRQKERAASPHQVTPQADEKMPVLESSSWPASKGLGHMPRAVEKGCVSDPLQTSGKAA
APSEDPWQAVKSLTQARLLSQPPAKAFLYEPTTQASGRASAGAEQTPGPLSQSPGLVKQA
KQMVGGQQLPALAAKSGQSFRSLGKAPASLPTEEKKLVTTEQSPWALGKASSRAGLWPIV
AGQTLAQSCWSAGSTQTLAQTCWSLGRGQDPKPEQNTLPALNQAPSSHKCAESEQK
Sequence length 2696
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG  Reactome
  Lysine degradation
Metabolic pathways
  PKMTs methylate histone lysines
Associated diseases Disease associations from ClinVar categorized as Causal (Pathogenic/Likely Pathogenic) or Unknown.
985
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Acute myeloid leukemia Likely pathogenic; Pathogenic rs587784115, rs587784170, rs794727176, rs1057520339, rs1562206791 RCV000760278
RCV000763137
RCV000763136
RCV001336054
RCV000760285
Beckwith-Wiedemann syndrome Pathogenic; Likely pathogenic rs587784104, rs587784174, rs374740802, rs878855075, rs878855077, rs1554189490, rs1060501492, rs1060501497, rs1060501490, rs1064794051, rs1554190214, rs1554190247, rs1554204952, rs1554204923, rs1554189131
View all (10 more)
RCV000461609
RCV000793328
RCV000195430
RCV000231883
RCV000231983
RCV000471287
RCV000465172
RCV000461382
RCV000463366
RCV001221224
RCV000549131
RCV000527387
RCV000543532
RCV000533339
RCV000558545
RCV000547744
RCV000628565
RCV000628556
RCV000697110
RCV000701163
RCV001053071
RCV000823647
RCV000799306
RCV001226345
RCV001226039
Intellectual disability Pathogenic rs587784092, rs2149887555 RCV001257657
RCV005626809
Marfanoid habitus and intellectual disability Likely pathogenic; Pathogenic rs1554204921 RCV000850418
Unknown Diseases with uncertain, conflicting, or no pathogenic evidence in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Abnormal cerebral morphology Uncertain significance rs2127285052 RCV002275913
Choroid plexus carcinoma other rs1554207664 RCV000505596
Hereditary cancer Conflicting classifications of pathogenicity rs754309202 RCV003491929
Hereditary spastic paraplegia 8 Uncertain significance rs1562312270 RCV000785152
Associations from Text MiningDisease associations identified through Pubtator
Disease Name Relationship Type References
Abnormalities Drug Induced Associate 16247291
Adenocarcinoma of Lung Associate 21151896
Alcohol Related Disorders Associate 16780628
Attention Deficit Disorder with Hyperactivity Associate 16780628
Autism Spectrum Disorder Associate 18001468
Autistic Disorder Associate 18001468
Beckwith Wiedemann Syndrome Associate 14997421
Bohring syndrome Associate 32938993
Breast Neoplasms Associate 29973584
Brugada Syndrome Associate 37114354