Gene
Entrez ID Entrez Gene ID - the GENE ID in NCBI Gene database.
9739
Gene name Gene Name - the full gene name approved by the HGNC.
SET domain containing 1A, histone lysine methyltransferase
Gene symbol Gene Symbol - the official gene symbol approved by the HGNC.
SETD1A
Synonyms (NCBI Gene) Gene synonyms aliases
EPEDD, EPEO2, KMT2F, NEDSID, Set1, Set1A
Disease Acronyms (UniProt) Disease acronyms from UniProt database
EPEO2, NEDSID
Chromosome Chromosome number
16
Chromosome location Chromosomal Location - indicates the cytogenetic location of the gene or region on the chromosome.
16p11.2
Summary Summary of gene provided in NCBI Entrez Gene.
The protein encoded by this gene is a component of a histone methyltransferase (HMT) complex that produces mono-, di-, and trimethylated histone H3 at Lys4. Trimethylation of histone H3 at lysine 4 (H3K4me3) is a chromatin modification known to generally
SNPs SNP information provided by dbSNP.
SNP ID Visualize variation Clinical significance Consequence
rs770913157 C>G,T Pathogenic Coding sequence variant, stop gained, synonymous variant
rs869312829 A>G Pathogenic Splice acceptor variant
rs869312830 C>- Pathogenic Frameshift variant, coding sequence variant
rs869312831 C>T Pathogenic Stop gained, coding sequence variant
rs869312832 ->C Pathogenic Frameshift variant, coding sequence variant
miRNA miRNA information provided by mirtarbase database.
miRTarBase ID miRNA Experiments Reference
MIRT051709 hsa-let-7e-5p CLASH 23622248
MIRT044797 hsa-miR-320a CLASH 23622248
MIRT044067 hsa-miR-361-5p CLASH 23622248
MIRT040819 hsa-miR-18a-3p CLASH 23622248
MIRT036712 hsa-miR-760 CLASH 23622248
Gene ontology (GO) Gene ontology information of associated ontologies with gene provided by GO database.
GO ID Ontology Definition Evidence Reference
GO:0000785 Component Chromatin IDA 27141965
GO:0003723 Function RNA binding IEA
GO:0005515 Function Protein binding IPI 12670868, 17998332, 18765639, 20622854, 22266653, 22665483, 22723415, 22843687, 23870121, 26886794
GO:0005634 Component Nucleus IDA 17500065
GO:0005634 Component Nucleus IMP 22723415
Other IDs Other ids provides unique ids of gene in databases such as OMIM, HGNC, ENSEMBLE.
MIM HGNC e!Ensembl
611052 29010 ENSG00000099381
Protein
UniProt ID O15047
Protein name Histone-lysine N-methyltransferase SETD1A (EC 2.1.1.364) (Lysine N-methyltransferase 2F) (SET domain-containing protein 1A) (hSET1A) (Set1/Ash2 histone methyltransferase complex subunit SET1)
Protein function Histone methyltransferase that catalyzes methyl group transfer from S-adenosyl-L-methionine to the epsilon-amino group of 'Lys-4' of histone H3 (H3K4) via a non-processive mechanism (PubMed:12670868, PubMed:25561738). Part of chromatin remodelin
PDB 3S8S , 3UVN , 4EWR , 8ILY
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00076 RRM_1 99 166 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain) Domain
PF11764 N-SET 1425 1562 COMPASS (Complex proteins associated with Set1p) component N Domain
PF00856 SET 1579 1685 SET domain Family
Sequence
MDQEGGGDGQKAPSFQWRNYKLIVDPALDPALRRPSQKVYRYDGVHFSVNDSKYIPVEDL
QDPRCHVRSKNRDFSLPVPKFKLDEFYIGQIPLKEVTFARLNDNVRETFLKDMCRKYGEV
EEVEILLHPRTRKHLGLARVLFTSTRGAKETVKNLHLTSVMGNIIH
AQLDIKGQQRMKYY
ELIVNGSYTPQTVPTGGKALSEKFQGSGAATETAESRRRSSSDTAAYPAGTTAVGTPGNG
TPCSQDTSFSSSRQDTPSSFGQFTPQSSQGTPYTSRGSTPYSQDSAYSSSTTSTSFKPRR
SENSYQDAFSRRHFSASSASTTASTAIAATTAATASSSASSSSLSSSSSSSSSSSSSQFR
SSDANYPAYYESWNRYQRHTSYPPRRATREEPPGAPFAENTAERFPPSYTSYLPPEPSRP
TDQDYRPPASEAPPPEPPEPGGGGGGGGPSPEREEVRTSPRPASPARSGSPAPETTNESV
PFAQHSSLDSRIEMLLKEQRSKFSFLASDTEEEEENSSMVLGARDTGSEVPSGSGHGPCT
PPPAPANFEDVAPTGSGEPGATRESPKANGQNQASPCSSGDDMEISDDDRGGSPPPAPTP
PQQPPPPPPPPPPPPPYLASLPLGYPPHQPAYLLPPRPDGPPPPEYPPPPPPPPHIYDFV
NSLELMDRLGAQWGGMPMSFQMQTQMLTRLHQLRQGKGLIAASAGPPGGAFGEAFLPFPP
PQEAAYGLPYALYAQGQEGRGAYSREAYHLPMPMAAEPLPSSSVSGEEARLPPREEAELA
EGKTLPTAGTVGRVLAMLVQEMKSIMQRDLNRKMVENVAFGAFDQWWESKEEKAKPFQNA
AKQQAKEEDKEKTKLKEPGLLSLVDWAKSGGTTGIEAFAFGSGLRGALRLPSFKVKRKEP
SEISEASEEKRPRPSTPAEEDEDDPEQEKEAGEPGRPGTKPPKRDEERGKTQGKHRKSFA
LDSEGEEASQESSSEKDEEDDEEDEEDEDREEAVDTTKKETEVSDGEDEESDSSSKCSLY
ADSDGENDSTSDSESSSSSSSSSSSSSSSSSSSSSSSSESSSEDEEEEERPAALPSASPP
PREVPVPTPAPVEVPVPERVAGSPVTPLPEQEASPARPAGPTEESPPSAPLRPPEPPAGP
PAPAPRPDERPSSPIPLLPPPKKRRKTVSFSAIEVVPAPEPPPATPPQAKFPGPASRKAP
RGVERTIRNLPLDHASLVKSWPEEVSRGGRSRAGGRGRLTEEEEAEPGTEVDLAVLADLA
LTPARRGLPALPAVEDSEATETSDEAERPRPLLSHILLEHNYALAVKPTPPAPALRPPEP
VPAPAALFSSPADEVLEAPEVVVAEAEEPKPQQLQQQREEGEEEGEEEGEEEEEESSDSS
SSSDGEGALRRRSLRSHARRRRPPPPPPPPPPRAYEPRSEFEQMTILYDIWNSGLDSEDM
SYLRLTYERLLQQTSGADWLNDTHWVHHTITNLTTPKRKRRPQDGPREHQTGSARSEGYY
PISKKEKDKYLDVCPVSARQLEGVDTQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKL
NQ
LKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIG
SSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEI
TYDYK
FPLEDNKIPCLCGTESCRGSLN
Sequence length 1707
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG   Reactome
  Lysine degradation
Metabolic pathways
  PKMTs methylate histone lysines
RUNX1 regulates genes involved in megakaryocyte differentiation and platelet function
Associated diseases Disease information provided by ClinVar, GenCC, and GWAS databases.
Causal
Disease term Disease name dbSNP ID References
Epilepsy Epilepsy rs113994140, rs28937874, rs1589762127, rs104894166, rs28939075, rs2134001459, rs104894167, rs119488099, rs119488100, rs387906420, rs121917752, rs121918622, rs281865564, rs387907313, rs397514670
View all (165 more)
31197650
Mental retardation Intellectual Disability rs5742905, rs267607136, rs267607137, rs2131714307, rs267607038, rs267607042, rs80338685, rs137853127, rs80338815, rs28940893, rs387906309, rs121908096, rs121908099, rs587784365, rs121918315
View all (1024 more)
26974950
Psoriasis Psoriasis rs281875215, rs587777763, rs281875213, rs281875212 26974007
Schizophrenia Schizophrenia rs13447324, rs387906932, rs387906933, rs863223354, rs863223355, rs776061422, rs863223349, rs748809996, rs759748655, rs863223353, rs863223350, rs863223356, rs781821239, rs863223348, rs863223346
View all (12 more)
26974950, 24853937
Unknown
Disease term Disease name Evidence References Source
Crohn disease Crohn Disease 26974007 ClinVar
Multiple Sclerosis Multiple Sclerosis GWAS
Associations from Text Mining
Disease Name Relationship Type References
Apraxias Associate 29463886, 36117209, 40225914
Arthritis Psoriatic Associate 20862685
Arthritis Rheumatoid Associate 35479077
Autistic Disorder Associate 26938441
Breast Neoplasms Associate 30191958, 30990809, 31253781, 33223203, 35966598
Breast Neoplasms Stimulate 31897900
Carcinogenesis Associate 36707804
Carcinoma Hepatocellular Associate 27656834, 32918976, 37581938
Carcinoma Non Small Cell Lung Associate 37548102
Carcinoma Pancreatic Ductal Stimulate 36271761