Gene
Entrez ID Entrez Gene ID - the GENE ID in NCBI Gene database.
8189
Gene name Gene Name - the full gene name approved by the HGNC.
Symplekin scaffold protein
Gene symbol Gene Symbol - the official gene symbol approved by the HGNC.
SYMPK
Synonyms (NCBI Gene) Gene synonyms aliases
Pta1, SPK, SYM
Chromosome Chromosome number
19
Chromosome location Chromosomal Location - indicates the cytogenetic location of the gene or region on the chromosome.
19q13.32
Summary Summary of gene provided in NCBI Entrez Gene.
This gene encodes a nuclear protein that functions in the regulation of polyadenylation and promotes gene expression. The protein forms a high-molecular weight complex with components of the polyadenylation machinery. It is thought to serve as a scaffold
miRNA miRNA information provided by mirtarbase database.
miRTarBase ID miRNA Experiments Reference
MIRT023496 hsa-miR-1-3p Proteomics 18668040
MIRT047571 hsa-miR-10a-5p CLASH 23622248
MIRT041533 hsa-miR-193b-3p CLASH 23622248
MIRT040725 hsa-miR-92b-3p CLASH 23622248
MIRT037562 hsa-miR-744-5p CLASH 23622248
Transcription factors
Transcription factor Regulation Reference
RUNX1 Repression 19328795
Gene ontology (GO) Gene ontology information of associated ontologies with gene provided by GO database.
GO ID Ontology Definition Evidence Reference
GO:0005515 Function Protein binding IPI 14707147, 18288197, 18688255, 20861839, 26496610, 32814053, 33961781, 34819669, 35177536
GO:0005634 Component Nucleus IEA
GO:0005654 Component Nucleoplasm IDA 8769423
GO:0005654 Component Nucleoplasm IEA
GO:0005654 Component Nucleoplasm TAS
Other IDs Other ids provides unique ids of gene in databases such as OMIM, HGNC, ENSEMBLE.
MIM HGNC e!Ensembl
602388 22935 ENSG00000125755
Protein
UniProt ID Q92797
Protein name Symplekin
Protein function Scaffold protein that functions as a component of a multimolecular complex involved in histone mRNA 3'-end processing. Specific component of the tight junction (TJ) plaque, but might not be an exclusively junctional component. May have a house-k
PDB 3O2Q , 3O2S , 3O2T , 3ODR , 3ODS , 4H3H , 4H3K , 6V4X
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF11935 DUF3453 122 349 Domain of unknown function (DUF3453) Family
PF12295 Symplekin_C 887 1068 Symplekin tight junction protein C terminal Family
Tissue specificity TISSUE SPECIFICITY: In testis, expressed in polar epithelia and Sertoli cells but not in vascular endothelia. The protein is detected in stomach, duodenum, pancreas, liver, fetal brain, carcinomas, lens-forming cells, fibroblasts, lymphocytes, lymphoma ce
Sequence
MASGSGDSVTRRSVASQFFTQEEGPGIDGMTTSERVVDLLNQAALITNDSKITVLKQVQE
LIINKDPTLLDNFLDEIIAFQADKSIEVRKFVIGFIEEACKRDIELLLKLIANLNMLLRD
ENVNVVKKAILTMTQLYKVALQWMVKSRVISELQEACWDMVSAMAGDIILLLDSDNDGIR
THAIKFVEGLIVTLSPRMADSEIPRRQEHDISLDRIPRDHPYIQYNVLWEEGKAALEQLL
KFMVHPAISSINLTTALGSLANIARQRPMFMSEVIQAYETLHANLPPTLAKSQVSSVRKN
LKLHLLSVLKHPASLEFQAQITTLLVDLGTPQAEIARNMPSSKDTRKRP
RDDSDSTLKKM
KLEPNLGEDDEDKDLEPGPSGTSKASAQISGQSDTDITAEFLQPLLTPDNVANLVLISMV
YLPEAMPASFQAIYTPVESAGTEAQIKHLARLMATQMTAAGLGPGVEQTKQCKEEPKEEK
VVKTESVLIKRRLSAQGQAISVVGSLSSMSPLEEEAPQAKRRPEPIIPVTQPRLAGAGGR
KKIFRLSDVLKPLTDAQVEAMKLGAVKRILRAEKAVACSGAAQVRIKILASLVTQFNSGL
KAEVLSFILEDVRARLDLAFAWLYQEYNAYLAAGASGSLDKYEDCLIRLLSGLQEKPDQK
DGIFTKVVLEAPLITESALEVVRKYCEDESRTYLGMSTLRDLIFKRPSRQFQYLHVLLDL
SSHEKDKVRSQALLFIKRMYEKEQLREYVEKFALNYLQLLVHPNPPSVLFGADKDTEVAA
PWTEETVKQCLYLYLALLPQNHKLIHELAAVYTEAIADIKRTVLRVIEQPIRGMGMNSPE
LLLLVENCPKGAETLVTRCLHSLTDKVPPSPELVKRVRDLYHKRLPDVRFLIPVLNGLEK
KEVIQALPKLIKLNPIVVKEVFNRLLGTQHGEGNSALSPLNPGELLIALHNIDSVKCDMK
SIIKATNLCFAERNVYTSEVLAVVMQQLMEQSPLPMLLMRTVIQSLTMYPRLGGFVMNIL
SRLIMKQVWKYPKVWEGFIKCCQRTKPQSFQVILQLPPQQLGAVFDKC
PELREPLLAHVR
SFTPHQQAHIPNSIMTILEASGKQEPEAKEAPAGPLEEDDLEPLTLAPAPAPRPPQDLIG
LRLAQEKALKRQLEEEQKLKPGGVGAPSSSSPSPSPSARPGPPPSEEAMDFREEGPECET
PGIFISMDDDSGLTEAALLDSSLEGPLPKETAAGGLTLKEERSPQTLAPVGEDAMKTPSP
AAEDAREPEAKGNS
Sequence length 1274
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG   Reactome
  mRNA surveillance pathway
Tight junction
  Transport of Mature mRNA Derived from an Intronless Transcript
mRNA Splicing - Major Pathway
mRNA 3'-end processing
RNA Polymerase II Transcription Termination
Processing of Intronless Pre-mRNAs
<
Associated diseases Disease associations categorized as Causal (pathogenic variants), Unknown (uncertain genetic evidence), or Text Mining (literature-based associations)
Unknown Includes: (1) ClinVar NON-pathogenic variants (Uncertain, Benign, Conflicting, VUS), (2) GenCC associations, (3) GWAS associations, (4) CBGDA evidence-based associations. NOTE: Diseases with pathogenic evidence are excluded to avoid conflicts.
Disease merge term Disease name Evidence References Source
Alzheimer disease Alzheimer's disease or gastroesophageal reflux disease N/A N/A GWAS
Asthma Asthma N/A N/A GWAS
Cholelithiasis Cholelithiasis N/A N/A GWAS
Colorectal Cancer Colorectal cancer N/A N/A GWAS
Associations from Text Mining Disease associations identified through Pubtator
Disease Name Relationship Type References
Breast Neoplasms Associate 27526934
Carcinogenesis Associate 28630428
Colorectal Neoplasms Associate 28295283, 28630428
Thyroid Cancer Papillary Associate 33110161