Gene Gene information from NCBI Gene database.
Entrez ID 9739
Gene name SET domain containing 1A, histone lysine methyltransferase
Gene symbol SETD1A
Synonyms (NCBI Gene)
EPEDDEPEO2KMT2FNEDSIDSet1Set1A
Chromosome 16
Chromosome location 16p11.2
Summary The protein encoded by this gene is a component of a histone methyltransferase (HMT) complex that produces mono-, di-, and trimethylated histone H3 at Lys4. Trimethylation of histone H3 at lysine 4 (H3K4me3) is a chromatin modification known to generally
SNPs SNP information provided by dbSNP.
6
SNP ID Visualize variation Clinical significance Consequence
rs770913157 C>G,T Pathogenic Coding sequence variant, stop gained, synonymous variant
rs869312829 A>G Pathogenic Splice acceptor variant
rs869312830 C>- Pathogenic Frameshift variant, coding sequence variant
rs869312831 C>T Pathogenic Stop gained, coding sequence variant
rs869312832 ->C Pathogenic Frameshift variant, coding sequence variant
miRNA miRNA information provided by mirtarbase database.
202
miRTarBase ID miRNA Experiments Reference
MIRT051709 hsa-let-7e-5p CLASH 23622248
MIRT044797 hsa-miR-320a CLASH 23622248
MIRT044067 hsa-miR-361-5p CLASH 23622248
MIRT040819 hsa-miR-18a-3p CLASH 23622248
MIRT036712 hsa-miR-760 CLASH 23622248
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
37
GO ID Ontology Definition Evidence Reference
GO:0000785 Component Chromatin IDA 27141965
GO:0003676 Function Nucleic acid binding IEA
GO:0003723 Function RNA binding IDA 38003223
GO:0003723 Function RNA binding IEA
GO:0005515 Function Protein binding IPI 12670868, 17998332, 18765639, 20622854, 22266653, 22665483, 22723415, 22843687, 23870121, 26886794, 33961781, 37030068
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
611052 29010 ENSG00000099381
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
O15047
Protein name Histone-lysine N-methyltransferase SETD1A (EC 2.1.1.364) (Lysine N-methyltransferase 2F) (SET domain-containing protein 1A) (hSET1A) (Set1/Ash2 histone methyltransferase complex subunit SET1)
Protein function Histone methyltransferase that catalyzes methyl group transfer from S-adenosyl-L-methionine to the epsilon-amino group of 'Lys-4' of histone H3 (H3K4) via a non-processive mechanism (PubMed:12670868, PubMed:25561738). Part of chromatin remodelin
PDB 3S8S , 3UVN , 4EWR , 8ILY
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00076 RRM_1 99 166 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain) Domain
PF11764 N-SET 1425 1562 COMPASS (Complex proteins associated with Set1p) component N Domain
PF00856 SET 1579 1685 SET domain Family
Sequence
MDQEGGGDGQKAPSFQWRNYKLIVDPALDPALRRPSQKVYRYDGVHFSVNDSKYIPVEDL
QDPRCHVRSKNRDFSLPVPKFKLDEFYIGQIPLKEVTFARLNDNVRETFLKDMCRKYGEV
EEVEILLHPRTRKHLGLARVLFTSTRGAKETVKNLHLTSVMGNIIH
AQLDIKGQQRMKYY
ELIVNGSYTPQTVPTGGKALSEKFQGSGAATETAESRRRSSSDTAAYPAGTTAVGTPGNG
TPCSQDTSFSSSRQDTPSSFGQFTPQSSQGTPYTSRGSTPYSQDSAYSSSTTSTSFKPRR
SENSYQDAFSRRHFSASSASTTASTAIAATTAATASSSASSSSLSSSSSSSSSSSSSQFR
SSDANYPAYYESWNRYQRHTSYPPRRATREEPPGAPFAENTAERFPPSYTSYLPPEPSRP
TDQDYRPPASEAPPPEPPEPGGGGGGGGPSPEREEVRTSPRPASPARSGSPAPETTNESV
PFAQHSSLDSRIEMLLKEQRSKFSFLASDTEEEEENSSMVLGARDTGSEVPSGSGHGPCT
PPPAPANFEDVAPTGSGEPGATRESPKANGQNQASPCSSGDDMEISDDDRGGSPPPAPTP
PQQPPPPPPPPPPPPPYLASLPLGYPPHQPAYLLPPRPDGPPPPEYPPPPPPPPHIYDFV
NSLELMDRLGAQWGGMPMSFQMQTQMLTRLHQLRQGKGLIAASAGPPGGAFGEAFLPFPP
PQEAAYGLPYALYAQGQEGRGAYSREAYHLPMPMAAEPLPSSSVSGEEARLPPREEAELA
EGKTLPTAGTVGRVLAMLVQEMKSIMQRDLNRKMVENVAFGAFDQWWESKEEKAKPFQNA
AKQQAKEEDKEKTKLKEPGLLSLVDWAKSGGTTGIEAFAFGSGLRGALRLPSFKVKRKEP
SEISEASEEKRPRPSTPAEEDEDDPEQEKEAGEPGRPGTKPPKRDEERGKTQGKHRKSFA
LDSEGEEASQESSSEKDEEDDEEDEEDEDREEAVDTTKKETEVSDGEDEESDSSSKCSLY
ADSDGENDSTSDSESSSSSSSSSSSSSSSSSSSSSSSSESSSEDEEEEERPAALPSASPP
PREVPVPTPAPVEVPVPERVAGSPVTPLPEQEASPARPAGPTEESPPSAPLRPPEPPAGP
PAPAPRPDERPSSPIPLLPPPKKRRKTVSFSAIEVVPAPEPPPATPPQAKFPGPASRKAP
RGVERTIRNLPLDHASLVKSWPEEVSRGGRSRAGGRGRLTEEEEAEPGTEVDLAVLADLA
LTPARRGLPALPAVEDSEATETSDEAERPRPLLSHILLEHNYALAVKPTPPAPALRPPEP
VPAPAALFSSPADEVLEAPEVVVAEAEEPKPQQLQQQREEGEEEGEEEGEEEEEESSDSS
SSSDGEGALRRRSLRSHARRRRPPPPPPPPPPRAYEPRSEFEQMTILYDIWNSGLDSEDM
SYLRLTYERLLQQTSGADWLNDTHWVHHTITNLTTPKRKRRPQDGPREHQTGSARSEGYY
PISKKEKDKYLDVCPVSARQLEGVDTQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKL
NQ
LKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIG
SSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEI
TYDYK
FPLEDNKIPCLCGTESCRGSLN
Sequence length 1707
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG  Reactome
  Lysine degradation
Metabolic pathways
  PKMTs methylate histone lysines
RUNX1 regulates genes involved in megakaryocyte differentiation and platelet function
Associated diseases Disease associations from ClinVar categorized as Causal (Pathogenic/Likely Pathogenic) or Unknown.
159
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Autism spectrum disorder Likely pathogenic rs2543876108 RCV003127372
Epilepsy, early-onset, with or without developmental delay Likely pathogenic; Pathogenic rs2143509708, rs61744449, rs2543861155, rs2056158149, rs755127868, rs2056165149, rs2056100951, rs781482552, rs2056003750, rs2056118338 RCV002225059
RCV002289157
RCV004783056
RCV004783064
RCV001788230
RCV001034711
RCV001034712
RCV001034713
RCV001250554
RCV001254164
Intellectual disability Likely pathogenic rs2056144779 RCV001257739
Neurodevelopmental disorder Pathogenic rs2143491920 RCV002273265
Unknown Diseases with uncertain, conflicting, or no pathogenic evidence in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Developmental disorder Likely benign; Uncertain significance rs550183612, rs2143483723, rs149618545, rs2056372684 RCV001843766
RCV001843788
RCV001843823
RCV003127373
Gastric cancer Conflicting classifications of pathogenicity rs374206451 RCV005908943
Sarcoma Conflicting classifications of pathogenicity rs374206451 RCV005908942
See cases Uncertain significance; Conflicting classifications of pathogenicity rs1034310028, rs1237118676, rs1567367226, rs766598337 RCV004584466
RCV002252983
RCV003232916
RCV003886331
Associations from Text MiningDisease associations identified through Pubtator
Disease Name Relationship Type References
Apraxias Associate 29463886, 36117209, 40225914
Arthritis Psoriatic Associate 20862685
Arthritis Rheumatoid Associate 35479077
Autistic Disorder Associate 26938441
Breast Neoplasms Associate 30191958, 30990809, 31253781, 33223203, 35966598
Breast Neoplasms Stimulate 31897900
Carcinogenesis Associate 36707804
Carcinoma Hepatocellular Associate 27656834, 32918976, 37581938
Carcinoma Non Small Cell Lung Associate 37548102
Carcinoma Pancreatic Ductal Stimulate 36271761