Gene Gene information from NCBI Gene database.
Entrez ID 6657
Gene name SRY-box transcription factor 2
Gene symbol SOX2
Synonyms (NCBI Gene)
ANOP3MCOPS3
Chromosome 3
Chromosome location 3q26.33
Summary This intronless gene encodes a member of the SRY-related HMG-box (SOX) family of transcription factors involved in the regulation of embryonic development and in the determination of cell fate. The product of this gene is required for stem-cell maintenanc
miRNA miRNA information provided by mirtarbase database.
77
miRTarBase ID miRNA Experiments Reference
MIRT005370 hsa-miR-126-3p ImmunohistochemistryLuciferase reporter assayMicroarrayqRT-PCRWestern blot 21304604
MIRT005370 hsa-miR-126-3p ImmunohistochemistryLuciferase reporter assayMicroarrayqRT-PCRWestern blot 21304604
MIRT005692 hsa-miR-522-3p Luciferase reporter assayWestern blot 21304604
MIRT000307 hsa-miR-145-5p FACSFlowGFP reporter assayIn situ hybridizationLuciferase reporter assayqRT-PCR 19409607
MIRT000307 hsa-miR-145-5p FACSFlowGFP reporter assayIn situ hybridizationLuciferase reporter assayqRT-PCR 19409607
Transcription factors Transcription factors information provided by TRRUST V2 database.
3
Transcription factor Regulation Reference
ID4 Unknown 23613880
KDM2A Unknown 23872478
POU5F1 Activation 17068183
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
68
GO ID Ontology Definition Evidence Reference
GO:0000122 Process Negative regulation of transcription by RNA polymerase II IBA
GO:0000122 Process Negative regulation of transcription by RNA polymerase II ISS
GO:0000785 Component Chromatin ISA
GO:0000976 Function Transcription cis-regulatory region binding IDA 18407919
GO:0000976 Function Transcription cis-regulatory region binding IEA
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
184429 11195 ENSG00000181449
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
P48431
Protein name Transcription factor SOX-2
Protein function Transcription factor that forms a trimeric complex with OCT4 on DNA and controls the expression of a number of genes involved in embryonic development such as YES1, FGF4, UTF1 and ZFP206 (By similarity). Binds to the proximal enhancer region of
PDB 1O4X , 2LE4 , 6T7B , 6T90 , 6WX7 , 6WX8 , 6WX9 , 6YOV
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00505 HMG_box 41 109 HMG (high mobility group) box Domain
PF12336 SOXp 110 200 SOX transcription factor Family
Sequence
MYNMMETELKPPGPQQTSGGGGGNSTAAAAGGNQKNSPDRVKRPMNAFMVWSRGQRRKMA
QENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYK
YRPRRKTKTLM
KKDKYTLPGGLLAPGGNSMASGVGVGAGLGAGVNQRMDSYAHMNGWSNGSYSMMQDQLGY
PQHPGLNAHGAAQMQPMHRY
DVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSM
GSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMSQHYQS
GPVPGTAINGTLPLSHM
Sequence length 317
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG  Reactome
  Hippo signaling pathway
Signaling pathways regulating pluripotency of stem cells
  POU5F1 (OCT4), SOX2, NANOG repress genes related to differentiation
POU5F1 (OCT4), SOX2, NANOG activate genes related to proliferation
Deactivation of the beta-catenin transactivating complex
Transcriptional regulation of pluripotent stem cells
Interleukin-4 and Interleukin-13 signaling
Associated diseases Disease associations from ClinVar (causal & non-causal) and other databases (OMIM, Orphanet, GWAS, etc.).
33
Evidence Score: ★☆☆☆☆  Gene-disease association found in Text Mining only ★★☆☆☆  Found in Text Mining and Unknown/Other Associations ★★★☆☆  Reported in Unknown/Other Associations across ≥2 Sources ★★★★☆  ClinVar: Pathogenic/Likely Pathogenic (<5 Variants) ★★★★★  ClinVar: Pathogenic/Likely Pathogenic (≥5 Variants)
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession Evidence Score
Anophthalmia Pathogenic rs2108522679, rs2473717674 RCV002291321
RCV002291331
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
Anophthalmia/microphthalmia-esophageal atresia syndrome Pathogenic; Likely pathogenic rs398123693, rs1714847764, rs2108522652, rs2108521702, rs2108522196, rs2108522646, rs2108522776, rs2108523389, rs2108522189, rs2473717602, rs2473718812, rs2473718342, rs398122803, rs55683010, rs2108522980
View all (48 more)
RCV001067610
RCV001342057
RCV001375935
RCV001388674
RCV001706841
View all (62 more)
★★★★★
ClinVar: Pathogenic / Likely Pathogenic (≥5 Variants)
Chorioretinal coloboma Likely pathogenic rs2108521642 RCV002291354
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
Developmental disorder Likely pathogenic; Pathogenic rs1249553271 RCV001843562
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
Unknown / Other Associations ClinVar entries with uncertain/conflicting evidence, and associations from other databases (OMIM, Orphanet, GWAS, etc.) where the gene is not established as causal.
Phenotype Name Clinical Significance Source Evidence Score
Amenorrhea Conflicting classifications of pathogenicity ClinVar
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
ANIRIDIA Disgenet
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
ANOPHTHALMOS CTD, Disgenet
CTD, Disgenet
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
ATAXIA CTD
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Associations from Text Mining Disease associations identified through text mining
Disease Name Disease (Merged) Source PMID Relationship Type Evidence Score
Accessory rib Accessory Rib HPO_DG
★☆☆☆☆
Found in Text Mining only
Adamantinous Craniopharyngioma Adamantinous Craniopharyngioma BEFREE 29180744
★☆☆☆☆
Found in Text Mining only
Adenocarcinoma Adenocarcinoma BEFREE 15910596, 20161759, 21334718, 21460799, 21687954, 22615765, 23086772, 23544055, 24481417, 24736592, 27766003, 28692180, 30405752, 31447265, 31610800
★☆☆☆☆
Found in Text Mining only
Adenocarcinoma of ampulla of Vater Adenocarcinoma BEFREE 16596179
★☆☆☆☆
Found in Text Mining only
Adenocarcinoma Of Esophagus Esophageal Cancer BEFREE 28059963, 28692180
★☆☆☆☆
Found in Text Mining only
Adenocarcinoma of large intestine Colorectal Cancer BEFREE 23599173
★☆☆☆☆
Found in Text Mining only
Adenocarcinoma of lung (disorder) Lung adenocarcinoma BEFREE 21931300, 24746758, 26780934, 28259951, 28737489, 29596469, 31059077
★☆☆☆☆
Found in Text Mining only
Adenocarcinoma of lung (disorder) Lung adenocarcinoma CTD_human_DG 25184679
★☆☆☆☆
Found in Text Mining only
Adenocarcinoma of prostate Prostate adenocarcinoma BEFREE 31772313
★☆☆☆☆
Found in Text Mining only
Adenoid Cystic Carcinoma Adenocarcinoma BEFREE 24828201, 29596469, 31113740
★☆☆☆☆
Found in Text Mining only