Gene Gene information from NCBI Gene database.
Entrez ID 7468
Gene name Nuclear receptor binding SET domain protein 2
Gene symbol NSD2
Synonyms (NCBI Gene)
KMT3FKMT3GMMSETRAUSTREIIBPTRX5WHSWHSC1
Chromosome 4
Chromosome location 4p16.3
Summary This gene encodes a protein that contains four domains present in other developmental proteins: a PWWP domain, an HMG box, a SET domain, and a PHD-type zinc finger. It is expressed ubiquitously in early development. Wolf-Hirschhorn syndrome (WHS) is a mal
SNPs SNP information provided by dbSNP.
9
SNP ID Visualize variation Clinical significance Consequence
rs748707745 C>T Pathogenic Genic upstream transcript variant, coding sequence variant, stop gained
rs1553873247 GAGA>- Pathogenic Coding sequence variant, splice acceptor variant, genic upstream transcript variant, intron variant, upstream transcript variant
rs1560602800 G>A Pathogenic Stop gained, coding sequence variant, genic upstream transcript variant
rs1560635105 C>T Pathogenic Stop gained, coding sequence variant, genic upstream transcript variant
rs1560696317 ->G Pathogenic Frameshift variant, coding sequence variant, genic upstream transcript variant
miRNA miRNA information provided by mirtarbase database.
51
miRTarBase ID miRNA Experiments Reference
MIRT237889 hsa-miR-5582-5p PAR-CLIP 20371350
MIRT563361 hsa-miR-3622a-5p PAR-CLIP 20371350
MIRT237891 hsa-miR-6826-5p PAR-CLIP 20371350
MIRT563360 hsa-miR-4775 PAR-CLIP 20371350
MIRT237888 hsa-miR-4423-3p PAR-CLIP 20371350
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
39
GO ID Ontology Definition Evidence Reference
GO:0000122 Process Negative regulation of transcription by RNA polymerase II IEA
GO:0000785 Component Chromatin IBA
GO:0003149 Process Membranous septum morphogenesis IEA
GO:0003289 Process Atrial septum primum morphogenesis IEA
GO:0003290 Process Atrial septum secundum morphogenesis IEA
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
602952 12766 ENSG00000109685
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
O96028
Protein name Histone-lysine N-methyltransferase NSD2 (EC 2.1.1.357) (Multiple myeloma SET domain-containing protein) (MMSET) (Nuclear SET domain-containing protein 2) (Protein trithorax-5) (Wolf-Hirschhorn syndrome candidate 1 protein)
Protein function Histone methyltransferase which specifically dimethylates nucleosomal histone H3 at 'Lys-36' (H3K36me2) (PubMed:19808676, PubMed:22099308, PubMed:27571355, PubMed:29728617, PubMed:33941880). Also monomethylates nucleosomal histone H3 at 'Lys-36'
PDB 5LSU , 5VC8 , 6UE6 , 6XCG , 7CRO , 7E8D , 7LMT , 7MDN , 7VLN , 9EXW , 9EXX , 9EXY , 9GBF
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00855 PWWP 220 334 PWWP domain Domain
PF00505 HMG_box 455 508 HMG (high mobility group) box Domain
PF00855 PWWP 878 970 PWWP domain Domain
PF17907 AWS 1022 1060 AWS domain Domain
PF00856 SET 1074 1180 SET domain Family
PF17982 C5HCH 1282 1330 NSD Cys-His rich domain Domain
Tissue specificity TISSUE SPECIFICITY: Widely expressed (PubMed:18172012, PubMed:9618163). Predominantly expressed in thymus and testis (PubMed:18172012, PubMed:9787135). {ECO:0000269|PubMed:18172012, ECO:0000269|PubMed:9618163, ECO:0000269|PubMed:9787135}.
Sequence
MEFSIKQSPLSVQSVVKCIKMKQAPEILGSANGKTPSCEVNRECSVFLSKAQLSSSLQEG
VMQKFNGHDALPFIPADKLKDLTSRVFNGEPGAHDAKLRFESQEMKGIGTPPNTTPIKNG
SPEIKLKITKTYMNGKPLFESSICGDSAADVSQSEENGQKPENKARRNRKRSIKYDSLLE
QGLVEAALVSKISSPSDKKIPAKKESCPNTGRDKDHLLKYNVGDLVWSKVSGYPWWPCMV
SADPLLHSYTKLKGQKKSARQYHVQFFGDAPERAWIFEKSLVAFEGEGQFEKLCQESAKQ
APTKAEKIKLLKPISGKLRAQWEMGIVQAEEAAS
MSVEERKAKFTFLYVGDQLHLNPQVA
KEAGIAAESLGEMAESSGVSEEAAENPKSVREECIPMKRRRRAKLCSSAETLESHPDIGK
STPQKTAEADPRRGVGSPPGRKKTTVSMPRSRKGDAASQFLVFCQKHRDEVVAEHPDASG
EEIEELLRSQWSLLSEKQRARYNTKFAL
VAPVQAEEDSGNVNGKKRNHTKRIQDPTEDAE
AEDTPRKRLRTDKHSLRKRDTITDKTARTSSYKAMEAASSLKSQAATKNLSDACKPLKKR
NRASTAASSALGFSKSSSPSASLTENEVSDSPGDEPSESPYESADETQTEVSVSSKKSER
GVTAKKEYVCQLCEKPGSLLLCEGPCCGAFHLACLGLSRRPEGRFTCSECASGIHSCFVC
KESKTDVKRCVVTQCGKFYHEACVKKYPLTVFESRGFRCPLHSCVSCHASNPSNPRPSKG
KMMRCVRCPVAYHSGDACLAAGCSVIASNSIICTAHFTARKGKRHHAHVNVSWCFVCSKG
GSLLCCESCPAAFHPDCLNIEMPDGSWFCNDCRAGKKLHFQDIIWVKLGNYRWWPAEVCH
PKNVPPNIQKMKHEIGEFPVFFFGSKDYYWTHQARVFPYMEGDRGSRYQGVRGIGRVFKN
ALQEAEARFR
EIKLQREARETQESERKPPPYKHIKVNKPYGKVQIYTADISEIPKCNCKP
TDENPCGFDSECLNRMLMFECHPQVCPAGEFCQNQCFTKRQYPETKIIKTDGKGWGLVAK
RDIRKGEFVNEYVGELIDEEECMARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFM
NHSCQPNCETLKWTVNGDTRVGLFAVCDIPAGTELTFNYN
LDCLGNEKTVCRCGASNCSG
FLGDRPKTSTTLSSEEKGKKTKKKTRRRRAKGEGKRQSEDECFRCGDGGQLVLCDRKFCT
KAYHLSCLGLGKRPFGKWECPWHHCDVCGKPSTSFCHLCPNSFCKEHQDGTAFSCTPDGR
SYCCEHDLGA
ASVRSTKTEKPPPEPGKPKGKRRRRRGWRRVTEGK
Sequence length 1365
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG  Reactome
  Lysine degradation
Metabolic pathways
Transcriptional misregulation in cancer
  PKMTs methylate histone lysines
Recruitment and ATM-mediated phosphorylation of repair and signaling proteins at DNA double strand breaks
Nonhomologous End-Joining (NHEJ)
Processing of DNA double-strand break ends
G2/M DNA damage checkpoint
Associated diseases Disease associations from ClinVar categorized as Causal (Pathogenic/Likely Pathogenic) or Unknown.
136
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
4p partial monosomy syndrome Pathogenic; Likely pathogenic rs1725000714, rs2108971782, rs2474331632, rs1553873247, rs1560602800, rs1560635105, rs1560696317 RCV001330272
RCV001754558
RCV002288414
RCV000660609
RCV001249663
RCV001249665
RCV001249664
atypical Wolf-Hirschhorn syndrome Pathogenic rs1717589360 RCV001254026
Global developmental delay Pathogenic rs1553873247 RCV001779037
Neurodevelopmental delay Pathogenic rs1553873247 RCV002274084
Unknown Diseases with uncertain, conflicting, or no pathogenic evidence in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Acute myeloid leukemia -; Benign rs73202837, rs886059317 RCV005950708
RCV005897801
Cervical cancer Benign; - rs199564032, rs10019502, rs886059317 RCV005929974
RCV005950593
RCV005897802
Cholangiocarcinoma Likely benign rs768855129 RCV005934711
Clear cell carcinoma of kidney Likely benign rs200130380 RCV005929975
Associations from Text MiningDisease associations identified through Pubtator
Disease Name Relationship Type References
Bone Diseases Associate 33420361
Bone Marrow Neoplasms Associate 32826945
Breast Neoplasms Associate 36805183, 39395300, 39667501
Capillary Malformation Arteriovenous Malformation Inhibit 19121287
Carcinogenesis Associate 18172012, 22028615, 37756162
Carcinoma Basal Cell Stimulate 30215205
Carcinoma Hepatocellular Associate 28426876
Carcinoma Non Small Cell Lung Stimulate 33658396
Carcinoma Non Small Cell Lung Associate 34313353
Carcinoma Renal Cell Associate 31692936