Gene Gene information from NCBI Gene database.
Entrez ID 9739
Gene name SET domain containing 1A, histone lysine methyltransferase
Gene symbol SETD1A
Synonyms (NCBI Gene)
EPEDDEPEO2KMT2FNEDSIDSet1Set1A
Chromosome 16
Chromosome location 16p11.2
Summary The protein encoded by this gene is a component of a histone methyltransferase (HMT) complex that produces mono-, di-, and trimethylated histone H3 at Lys4. Trimethylation of histone H3 at lysine 4 (H3K4me3) is a chromatin modification known to generally
SNPs SNP information provided by dbSNP.
6
SNP ID Visualize variation Clinical significance Consequence
rs770913157 C>G,T Pathogenic Coding sequence variant, stop gained, synonymous variant
rs869312829 A>G Pathogenic Splice acceptor variant
rs869312830 C>- Pathogenic Frameshift variant, coding sequence variant
rs869312831 C>T Pathogenic Stop gained, coding sequence variant
rs869312832 ->C Pathogenic Frameshift variant, coding sequence variant
miRNA miRNA information provided by mirtarbase database.
202
miRTarBase ID miRNA Experiments Reference
MIRT051709 hsa-let-7e-5p CLASH 23622248
MIRT044797 hsa-miR-320a CLASH 23622248
MIRT044067 hsa-miR-361-5p CLASH 23622248
MIRT040819 hsa-miR-18a-3p CLASH 23622248
MIRT036712 hsa-miR-760 CLASH 23622248
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
37
GO ID Ontology Definition Evidence Reference
GO:0000785 Component Chromatin IDA 27141965
GO:0003676 Function Nucleic acid binding IEA
GO:0003723 Function RNA binding IDA 38003223
GO:0003723 Function RNA binding IEA
GO:0005515 Function Protein binding IPI 12670868, 17998332, 18765639, 20622854, 22266653, 22665483, 22723415, 22843687, 23870121, 26886794, 33961781, 37030068
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
611052 29010 ENSG00000099381
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
O15047
Protein name Histone-lysine N-methyltransferase SETD1A (EC 2.1.1.364) (Lysine N-methyltransferase 2F) (SET domain-containing protein 1A) (hSET1A) (Set1/Ash2 histone methyltransferase complex subunit SET1)
Protein function Histone methyltransferase that catalyzes methyl group transfer from S-adenosyl-L-methionine to the epsilon-amino group of 'Lys-4' of histone H3 (H3K4) via a non-processive mechanism (PubMed:12670868, PubMed:25561738). Part of chromatin remodelin
PDB 3S8S , 3UVN , 4EWR , 8ILY
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00076 RRM_1 99 166 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain) Domain
PF11764 N-SET 1425 1562 COMPASS (Complex proteins associated with Set1p) component N Domain
PF00856 SET 1579 1685 SET domain Family
Sequence
MDQEGGGDGQKAPSFQWRNYKLIVDPALDPALRRPSQKVYRYDGVHFSVNDSKYIPVEDL
QDPRCHVRSKNRDFSLPVPKFKLDEFYIGQIPLKEVTFARLNDNVRETFLKDMCRKYGEV
EEVEILLHPRTRKHLGLARVLFTSTRGAKETVKNLHLTSVMGNIIH
AQLDIKGQQRMKYY
ELIVNGSYTPQTVPTGGKALSEKFQGSGAATETAESRRRSSSDTAAYPAGTTAVGTPGNG
TPCSQDTSFSSSRQDTPSSFGQFTPQSSQGTPYTSRGSTPYSQDSAYSSSTTSTSFKPRR
SENSYQDAFSRRHFSASSASTTASTAIAATTAATASSSASSSSLSSSSSSSSSSSSSQFR
SSDANYPAYYESWNRYQRHTSYPPRRATREEPPGAPFAENTAERFPPSYTSYLPPEPSRP
TDQDYRPPASEAPPPEPPEPGGGGGGGGPSPEREEVRTSPRPASPARSGSPAPETTNESV
PFAQHSSLDSRIEMLLKEQRSKFSFLASDTEEEEENSSMVLGARDTGSEVPSGSGHGPCT
PPPAPANFEDVAPTGSGEPGATRESPKANGQNQASPCSSGDDMEISDDDRGGSPPPAPTP
PQQPPPPPPPPPPPPPYLASLPLGYPPHQPAYLLPPRPDGPPPPEYPPPPPPPPHIYDFV
NSLELMDRLGAQWGGMPMSFQMQTQMLTRLHQLRQGKGLIAASAGPPGGAFGEAFLPFPP
PQEAAYGLPYALYAQGQEGRGAYSREAYHLPMPMAAEPLPSSSVSGEEARLPPREEAELA
EGKTLPTAGTVGRVLAMLVQEMKSIMQRDLNRKMVENVAFGAFDQWWESKEEKAKPFQNA
AKQQAKEEDKEKTKLKEPGLLSLVDWAKSGGTTGIEAFAFGSGLRGALRLPSFKVKRKEP
SEISEASEEKRPRPSTPAEEDEDDPEQEKEAGEPGRPGTKPPKRDEERGKTQGKHRKSFA
LDSEGEEASQESSSEKDEEDDEEDEEDEDREEAVDTTKKETEVSDGEDEESDSSSKCSLY
ADSDGENDSTSDSESSSSSSSSSSSSSSSSSSSSSSSSESSSEDEEEEERPAALPSASPP
PREVPVPTPAPVEVPVPERVAGSPVTPLPEQEASPARPAGPTEESPPSAPLRPPEPPAGP
PAPAPRPDERPSSPIPLLPPPKKRRKTVSFSAIEVVPAPEPPPATPPQAKFPGPASRKAP
RGVERTIRNLPLDHASLVKSWPEEVSRGGRSRAGGRGRLTEEEEAEPGTEVDLAVLADLA
LTPARRGLPALPAVEDSEATETSDEAERPRPLLSHILLEHNYALAVKPTPPAPALRPPEP
VPAPAALFSSPADEVLEAPEVVVAEAEEPKPQQLQQQREEGEEEGEEEGEEEEEESSDSS
SSSDGEGALRRRSLRSHARRRRPPPPPPPPPPRAYEPRSEFEQMTILYDIWNSGLDSEDM
SYLRLTYERLLQQTSGADWLNDTHWVHHTITNLTTPKRKRRPQDGPREHQTGSARSEGYY
PISKKEKDKYLDVCPVSARQLEGVDTQGTNRVLSERRSEQRRLLSAIGTSAIMDSDLLKL
NQ
LKFRKKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQMVADMREKRYVQEGIG
SSYLFRVDHDTIIDATKCGNLARFINHCCTPNCYAKVITIESQKKIVIYSKQPIGVDEEI
TYDYK
FPLEDNKIPCLCGTESCRGSLN
Sequence length 1707
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG  Reactome
  Lysine degradation
Metabolic pathways
  PKMTs methylate histone lysines
RUNX1 regulates genes involved in megakaryocyte differentiation and platelet function
Associated diseases Disease associations from ClinVar (causal & non-causal) and other databases (OMIM, Orphanet, GWAS, etc.).
26
Evidence Score: ★☆☆☆☆  Gene-disease association found in Text Mining only ★★☆☆☆  Found in Text Mining and Unknown/Other Associations ★★★☆☆  Reported in Unknown/Other Associations across ≥2 Sources ★★★★☆  ClinVar: Pathogenic/Likely Pathogenic (<5 Variants) ★★★★★  ClinVar: Pathogenic/Likely Pathogenic (≥5 Variants)
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession Evidence Score
Autism spectrum disorder Likely pathogenic rs2543876108 RCV003127372
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
Epilepsy, early-onset, with or without developmental delay Likely pathogenic; Pathogenic rs2143509708, rs61744449, rs2543861155, rs2056158149, rs755127868, rs2056165149, rs2056100951, rs781482552, rs2056003750, rs2056118338 RCV002225059
RCV002289157
RCV004783056
RCV004783064
RCV001788230
View all (5 more)
★★★★★
ClinVar: Pathogenic / Likely Pathogenic (≥5 Variants)
Intellectual disability Likely pathogenic rs2056144779 RCV001257739
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
Neurodevelopmental disorder Pathogenic rs2143491920 RCV002273265
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
Unknown / Other Associations ClinVar entries with uncertain/conflicting evidence, and associations from other databases (OMIM, Orphanet, GWAS, etc.) where the gene is not established as causal.
Phenotype Name Clinical Significance Source Evidence Score
ANKYLOSING SPONDYLITIS GWAS catalog
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
AUTISM SPECTRUM DISORDERS Disgenet
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
CHRONIC OBSTRUCTIVE PULMONARY DISEASE GWAS catalog
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
CROHN'S DISEASE GWAS catalog
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Associations from Text Mining Disease associations identified through text mining
Disease Name Disease (Merged) Source PMID Relationship Type Evidence Score
Adenocarcinoma Adenocarcinoma BEFREE 30377207
★☆☆☆☆
Found in Text Mining only
Ankylosing spondylitis Ankylosing Spondylitis GWASCAT_DG 26974007
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Apraxias Apraxia Pubtator 29463886, 36117209, 40225914 Associate
★☆☆☆☆
Found in Text Mining only
Arthritis Psoriatic Psoriatic arthritis Pubtator 20862685 Associate
★☆☆☆☆
Found in Text Mining only
Arthritis Rheumatoid Rheumatoid arthritis Pubtator 35479077 Associate
★☆☆☆☆
Found in Text Mining only
Autism Spectrum Disorders Autism Spectrum Disorder BEFREE 26938441, 31808517
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Autistic Disorder Autism Pubtator 26938441 Associate
★☆☆☆☆
Found in Text Mining only
Breast Carcinoma Breast Carcinoma BEFREE 30191958, 31253781
★☆☆☆☆
Found in Text Mining only
Breast Neoplasms Breast neoplasm Pubtator 30191958, 30990809, 31253781, 33223203, 35966598 Associate
★☆☆☆☆
Found in Text Mining only
Breast Neoplasms Breast neoplasm Pubtator 31897900 Stimulate
★☆☆☆☆
Found in Text Mining only