Gene
Entrez ID Entrez Gene ID - the GENE ID in NCBI Gene database.
23013
Gene name Gene Name - the full gene name approved by the HGNC.
Spen family transcriptional repressor
Gene symbol Gene Symbol - the official gene symbol approved by the HGNC.
SPEN
Synonyms (NCBI Gene) Gene synonyms aliases
HIAA0929, MINT, RATARS, RBM15C, SHARP
Disease Acronyms (UniProt) Disease acronyms from UniProt database
RATARS
Chromosome Chromosome number
1
Chromosome location Chromosomal Location - indicates the cytogenetic location of the gene or region on the chromosome.
1p36.21-p36.13
Summary Summary of gene provided in NCBI Entrez Gene.
This gene encodes a hormone inducible transcriptional repressor. Repression of transcription by this gene product can occur through interactions with other repressors, by the recruitment of proteins involved in histone deacetylation, or through sequestrat
miRNA miRNA information provided by mirtarbase database.
miRTarBase ID miRNA Experiments Reference
MIRT019521 hsa-miR-151a-5p Sequencing 20371350
MIRT051293 hsa-miR-16-5p CLASH 23622248
MIRT049048 hsa-miR-92a-3p CLASH 23622248
MIRT048159 hsa-miR-196a-5p CLASH 23622248
MIRT046031 hsa-miR-125b-5p CLASH 23622248
Gene ontology (GO) Gene ontology information of associated ontologies with gene provided by GO database.
GO ID Ontology Definition Evidence Reference
GO:0000122 Process Negative regulation of transcription by RNA polymerase II IDA 16287852
GO:0000398 Process MRNA splicing, via spliceosome IBA 21873635
GO:0001085 Function RNA polymerase II transcription factor binding IPI 16287852
GO:0003676 Function Nucleic acid binding IBA 21873635
GO:0003677 Function DNA binding IEA
Other IDs Other ids provides unique ids of gene in databases such as OMIM, HGNC, ENSEMBLE.
MIM HGNC e!Ensembl
613484 17575 ENSG00000065526
Protein
UniProt ID Q96T58
Protein name Msx2-interacting protein (SMART/HDAC1-associated repressor protein) (SPEN homolog)
Protein function May serve as a nuclear matrix platform that organizes and integrates transcriptional responses. In osteoblasts, supports transcription activation: synergizes with RUNX2 to enhance FGFR2-mediated activation of the osteocalcin FGF-responsive eleme
PDB 1OW1 , 2RT5 , 4P6Q , 7Z1K
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00076 RRM_1 8 75 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain) Domain
PF00076 RRM_1 337 407 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain) Domain
PF00076 RRM_1 440 507 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain) Domain
PF00076 RRM_1 519 583 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain) Domain
PF07744 SPOC 3499 3664 SPOC domain Domain
Tissue specificity TISSUE SPECIFICITY: Expressed at high level in brain, testis, spleen and thymus. Expressed at intermediate level in kidney, liver, mammary gland and skin.
Sequence
MVRETRHLWVGNLPENVREEKIIEHFKRYGRVESVKILPKRGSEGGVAAFVDFVDIKSAQ
KAHNSVNKMGDRDLR
TDYNEPGTIPSAARGLDDTVSIASRSREVSGFRGGGGGPAYGPPP
SLHAREGRYERRLDGASDNRERAYEHSAYGHHERGTGGFDRTRHYDQDYYRDPRERTLQH
GLYYASRSRSPNRFDAHDPRYEPRAREQFTLPSVVHRDIYRDDITREVRGRRPERNYQHS
RSRSPHSSQSRNQSPQRLASQASRPTRSPSGSGSRSRSSSSDSISSSSSTSSDSSDSSSS
SSDDSPARSVQSAAVPAPTSQLLSSLEKDEPRKSFGIKVQNLPVRSTDTSLKDGLFHEFK
KFGKVTSVQIHGTSEERYGLVFFRQQEDQEKALTASKGKLFFGMQIE
VTAWIGPETESEN
EFRPLDERIDEFHPKATRTLFIGNLEKTTTYHDLRNIFQRFGEIVDIDIKKVNGVPQYAF
LQYCDIASVCKAIKKMDGEYLGNNRLK
LGFGKSMPTNCVWLDGLSSNVSDQYLTRHFCRY
GPVVKVVFDRLKGMALVLYNEIEYAQAAVKETKGRKIGGNKIK
VDFANRESQLAFYHCME
KSGQDIRDFYEMLAERREERRASYDYNQDRTYYESVRTPGTYPEDSRRDYPARGREFYSE
WETYQGDYYESRYYDDPREYRDYRNDPYEQDIREYSYRQRERERERERFESDRDRDHERR
PIERSQSPVHLRRPQSPGASPSQAERLPSDSERRLYSRSSDRSGSCSSLSPPRYEKLDKS
RLERYTKNEKTDKERTFDPERVERERRLIRKEKVEKDKTDKQKRKGKVHSPSSQSSETDQ
ENEREQSPEKPRSCNKLSREKADKEGIAKNRLELMPCVVLTRVKEKEGKVIDHTPVEKLK
AKLDNDTVKSSALDQKLQVSQTEPAKSDLSKLESVRMKVPKEKGLSSHVEVVEKEGRLKA
RKHLKPEQPADGVSAVDLEKLEARKRRFADSNLKAEKQKPEVKKSSPEMEDARVLSKKQP
DVSSREVILLREGEAERKPVRKEILKRESKKIKLDRLNTVASPKDCQELASISVGSGSRP
SSDLQARLGELAGESVENQEVQSKKPIPSKPQLKQLQVLDDQGPEREDVRKNYCSLRDET
PERKSGQEKSHSVNTEEKIGIDIDHTQSYRKQMEQSRRKQQMEMEIAKSEKFGSPKKDVD
EYERRSLVHEVGKPPQDVTDDSPPSKKKRMDHVDFDICTKRERNYRSSRQISEDSERTGG
SPSVRHGSFHEDEDPIGSPRLLSVKGSPKVDEKVLPYSNITVREESLKFNPYDSSRREQM
ADMAKIKLSVLNSEDELNRWDSQMKQDAGRFDVSFPNSIIKRDSLRKRSVRDLEPGEVPS
DSDEDGEHKSHSPRASALYESSRLSFLLRDREDKLRERDERLSSSLERNKFYSFALDKTI
TPDTKALLERAKSLSSSREENWSFLDWDSRFANFRNNKDKEKVDSAPRPIPSWYMKKKKI
RTDSEGKMDDKKEDHKEEEQERQELFASRFLHSSIFEQDSKRLQHLERKEEDSDFISGRI
YGKQTSEGANSTTDSIQEPVVLFHSRFMELTRMQQKEKEKDQKPKEVEKQEDTENHPKTP
ESAPENKDSELKTPPSVGPPSVTVVTLESAPSALEKTTGDKTVEAPLVTEEKTVEPATVS
EEAKPASEPAPAPVEQLEQVDLPPGADPDKEAAMMPAGVEEGSSGDQPPYLDAKPPTPGA
SFSQAESNVDPEPDSTQPLSKPAQKSEEANEPKAEKPDATADAEPDANQKAEAAPESQPP
ASEDLEVDPPVAAKDKKPNKSKRSKTPVQAAAVSIVEKPVTRKSERIDREKLKRSNSPRG
EAQKLLELKMEAEKITRTASKNSAADLEHPEPSLPLSRTRRRNVRSVYATMGDHENRSPV
KEPVEQPRVTRKRLERELQEAAAVPTTPRRGRPPKTRRRADEEEENEAKEPAETLKPPEG
WRSPRSQKTAAGGGPQGKKGKNEPKVDATRPEATTEVGPQIGVKESSMEPKAAEEEAGSE
QKRDRKDAGTDKNPPETAPVEVVEKKPAPEKNSKSKRGRSRNSRLAVDKSASLKNVDAAV
SPRGAAAQAGERESGVVAVSPEKSESPQKEDGLSSQLKSDPVDPDKEPEKEDVSASGPSP
EATQLAKQMELEQAVEHIAKLAEASASAAYKADAPEGLAPEDRDKPAHQASETELAAAIG
SIINDISGEPENFPAPPPYPGESQTDLQPPAGAQALQPSEEGMETDEAVSGILETEAATE
SSRPPVNAPDPSAGPTDTKEARGNSSETSHSVPEAKGSKEVEVTLVRKDKGRQKTTRSRR
KRNTNKKVVAPVESHVPESNQAQGESPAANEGTTVQHPEAPQEEKQSEKPHSTPPQSCTS
DLSKIPSTENSSQEISVEERTPTKASVPPDLPPPPQPAPVDEEPQARFRVHSIIESDPVT
PPSDPSIPIPTLPSVTAAKLSPPVASGGIPHQSPPTKVTEWITRQEEPRAQSTPSPALPP
DTKASDVDTSSSTLRKILMDPKYVSATSVTSTSVTTAIAEPVSAAPCLHEAPPPPVDSKK
PLEEKTAPPVTNNSEIQASEVLVAADKEKVAPVIAPKITSVISRMPVSIDLENSQKITLA
KPAPQTLTGLVSALTGLVNVSLVPVNALKGPVKGSVTTLKSLVSTPAGPVNVLKGPVNVL
TGPVNVLTTPVNATVGTVNAAPGTVNAAASAVNATASAVTVTAGAVTAASGGVTATTGTV
TMAGAVIAPSTKCKQRASANENSRFHPGSMPVIDDRPADAGSGAGLRVNTSEGVVLLSYS
GQKTEGPQRISAKISQIPPASAMDIEFQQSVSKSQVKPDSVTASQPPSKGPQAPAGYANV
ATHSTLVLTAQTYNASPVISSVKADRPSLEKPEPIHLSVSTPVTQGGTVKVLTQGINTPP
VLVHNQLVLTPSIVTTNKKLADPVTLKIETKVLQPANLGSTLTPHHPPALPSKLPTEVNH
VPSGPSIPADRTVSHLAAAKLDAHSPRPSGPGPSSFPRASHPSSTASTALSTNATVMLAA
GIPVPQFISSIHPEQSVIMPPHSITQTVSLSHLSQGEVRMNTPTLPSITYSIRPEALHSP
RAPLQPQQIEVRAPQRASTPQPAPAGVPALASQHPPEEEVHYHLPVARATAPVQSEVLVM
QSEYRLHPYTVPRDVRIMVHPHVTAVSEQPRAADGVVKVPPASKAPQQPGKEAAKTPDAK
AAPTPTPAPVPVPVPLPAPAPAPHGEARILTVTPSNQLQGLPLTPPVVVTHGVQIVHSSG
ELFQEYRYGDIRTYHPPAQLTHTQFPAASSVGLPSRTKTAAQGPPPEGEPLQPPQPVQST
QPAQPAPPCPPSQLGQPGQPPSSKMPQVSQEAKGTQTGVEQPRLPAGPANRPPEPHTQVQ
RAQAETGPTSFPSPVSVSMKPDLPVSLPTQTAPKQPLFVPTTSGPSTPPGLVLPHTEFQP
APKQDSSPHLTSQRPVDMVQLLKKYPIVWQGLLALKNDTAAVQLHFVSGNNVLAHRSLPL
SEGGPPLRIAQRMRLEATQLEGVARRMTVETDYCLLLALPCGRDQEDVVSQTESLKAAFI
TYLQAKQAAGIINVPNPGSNQPAYVLQIFPPCEFSESHLSRLAPDLLASISNISPHLMIV
IASV
Sequence length 3664
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG  
  Notch signaling pathway  
Associated diseases Disease information provided by ClinVar, GenCC, and GWAS databases.
Causal
Disease term Disease name dbSNP ID References
Breast cancer Malignant neoplasm of breast rs587776547, rs1137887, rs137853007, rs587776650, rs80359351, rs80359714, rs121917783, rs104886456, rs121964878, rs80359874, rs80357868, rs80357508, rs387906843, rs80357569, rs80358158
View all (309 more)
Diffuse lymphoma Diffuse Large B-Cell Lymphoma rs121912651, rs121913289, rs121913293, rs878854402, rs869025340, rs1349928568, rs1569115687, rs121913291
Non-hodgkin lymphoma Lymphoma, Non-Hodgkin rs121908689, rs28936699, rs121909775, rs398122800, rs121913357, rs121913355, rs398122820
Prostate cancer Malignant neoplasm of prostate rs121909139, rs121909140, rs121909141, rs121909142, rs121909143, rs606231169, rs606231170, rs137852584, rs137852578, rs137852580, rs137852581, rs137852582 29610475
Unknown
Disease term Disease name Evidence References Source
Neurodevelopmental Disorders complex neurodevelopmental disorder GenCC
Psoriasis Psoriasis GWAS
Alzheimer disease Alzheimer disease GWAS
Atrial Fibrillation Atrial Fibrillation GWAS
Associations from Text Mining
Disease Name Relationship Type References
Alzheimer Disease Associate 22355143
Attention Deficit Disorder with Hyperactivity Associate 33596411
Autism Spectrum Disorder Associate 33596411
Breast Neoplasms Associate 32641685
Carcinogenesis Associate 37620924
Carcinoma Adenoid Cystic Associate 23778135, 23778141
Carcinoma Adenosquamous Associate 31958074
Carcinoma Hepatocellular Associate 34288818
Cardiovascular Abnormalities Associate 24454898
Chromosome 1p36 Deletion Syndrome Inhibit 33596411