Gene Gene information from NCBI Gene database.
Entrez ID 6659
Gene name SRY-box transcription factor 4
Gene symbol SOX4
Synonyms (NCBI Gene)
CSS10EVI16IDDSDF
Chromosome 6
Chromosome location 6p22.3
Summary This intronless gene encodes a member of the SOX (SRY-related HMG-box) family of transcription factors involved in the regulation of embryonic development and in the determination of the cell fate. The encoded protein may act as a transcriptional regulato
SNPs SNP information provided by dbSNP.
4
SNP ID Visualize variation Clinical significance Consequence
rs1334099693 C>A,T Likely-pathogenic, pathogenic Missense variant, coding sequence variant, synonymous variant
rs1464282327 G>A,C Pathogenic Coding sequence variant, missense variant
rs1582601669 T>G Pathogenic Missense variant, coding sequence variant
rs1582601747 G>T Pathogenic Missense variant, coding sequence variant
miRNA miRNA information provided by mirtarbase database.
2272
miRTarBase ID miRNA Experiments Reference
MIRT002431 hsa-miR-335-5p Review 19935707
MIRT004043 hsa-miR-129-5p Review 20026422
MIRT004691 hsa-miR-191-5p ImmunohistochemistryImmunoprecipitaionLuciferase reporter assayMicroarrayqRT-PCR 20924108
MIRT004043 hsa-miR-129-5p In situ hybridizationMicroarrayqRT-PCR 19487295
MIRT004043 hsa-miR-129-5p In situ hybridizationMicroarrayqRT-PCR 19487295
Transcription factors Transcription factors information provided by TRRUST V2 database.
2
Transcription factor Regulation Reference
ERG Unknown 24435928
JUND Activation 23482931
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
98
GO ID Ontology Definition Evidence Reference
GO:0000122 Process Negative regulation of transcription by RNA polymerase II IBA
GO:0000122 Process Negative regulation of transcription by RNA polymerase II IDA 26291311
GO:0000785 Component Chromatin ISA
GO:0000976 Function Transcription cis-regulatory region binding IEA
GO:0000976 Function Transcription cis-regulatory region binding ISS
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
184430 11200 ENSG00000124766
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
Q06945
Protein name Transcription factor SOX-4
Protein function Transcriptional activator that binds with high affinity to the T-cell enhancer motif 5'-AACAAAG-3' motif (PubMed:30661772). Required for IL17A-producing Vgamma2-positive gamma-delta T-cell maturation and development, via binding to regulator loc
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00505 HMG_box 59 127 HMG (high mobility group) box Domain
Tissue specificity TISSUE SPECIFICITY: Testis, brain, and heart. {ECO:0000269|PubMed:8268656}.
Sequence
MVQQTNNAENTEALLAGESSDSGAGLELGIASSPTPGSTASTGGKADDPSWCKTPSGHIK
RPMNAFMVWSQIERRKIMEQSPDMHNAEISKRLGKRWKLLKDSDKIPFIREAERLRLKHM
ADYPDYK
YRPRKKVKSGNANSSSSAAASSKPGEKGDKVGGSGGGGHGGGGGGGSSNAGGG
GGGASGGGANSKPAQKKSCGSKVAGGAGGGVSKPHAKLILAGGGGGGKAAAAAAASFAAE
QAGAAALLPLGAAADHHSLYKARTPSASASASSAASASAALAAPGKHLAEKKVKRVYLFG
GLGTSSSPVGGVGAGADPSDPLGLYEEEGAGCSPDAPSLSGRSSAASSPAAGRSPADHRG
YASLRAASPAPSSAPSHASSSASSHSSSSSSSGSSSSDDEFEDDLLDLNPSSNFESMSLG
SFSSSSALDRDLDFNFEPGSGSHFEFPDYCTPEVSEMISGDWLESSISNLVFTY
Sequence length 474
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG  Reactome
  MicroRNAs in cancer   Deactivation of the beta-catenin transactivating complex
Associated diseases Disease associations from ClinVar (causal & non-causal) and other databases (OMIM, Orphanet, GWAS, etc.).
22
Evidence Score: ★☆☆☆☆  Gene-disease association found in Text Mining only ★★☆☆☆  Found in Text Mining and Unknown/Other Associations ★★★☆☆  Reported in Unknown/Other Associations across ≥2 Sources ★★★★☆  ClinVar: Pathogenic/Likely Pathogenic (<5 Variants) ★★★★★  ClinVar: Pathogenic/Likely Pathogenic (≥5 Variants)
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession Evidence Score
Coffin-Siris syndrome 10 Likely pathogenic; Pathogenic rs2113558441, rs1305965061, rs2532642024, rs2532639089, rs2532639300, rs2532639524, rs2532639544, rs2532639461, rs1334099693, rs1464282327, rs1582601669, rs1582601747 RCV001598703
RCV002246736
RCV003152402
RCV003152403
RCV003329106
View all (7 more)
★★★★★
ClinVar: Pathogenic / Likely Pathogenic (≥5 Variants)
Developmental delay Likely pathogenic; Pathogenic rs1334099693, rs1464282327, rs1582601669, rs1582601747 RCV001261716
RCV001261717
RCV001261718
RCV001261719
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
Intellectual disability Likely pathogenic; Pathogenic rs1334099693, rs1464282327, rs1582601669, rs1582601747 RCV001261716
RCV001261717
RCV001261718
RCV001261719
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
Mild facial and digital morphological abnormalities Likely pathogenic; Pathogenic rs1334099693, rs1464282327, rs1582601669, rs1582601747 RCV001261716
RCV001261717
RCV001261718
RCV001261719
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
Unknown / Other Associations ClinVar entries with uncertain/conflicting evidence, and associations from other databases (OMIM, Orphanet, GWAS, etc.) where the gene is not established as causal.
Phenotype Name Clinical Significance Source Evidence Score
ATRIAL FIBRILLATION Disgenet, GenCC
Disgenet, GenCC
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
CARCINOMA, ADENOID CYSTIC CTD
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
CARCINOMA, BASAL CELL CTD
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
CARDIOMEGALY CTD
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Associations from Text Mining Disease associations identified through text mining
Disease Name Disease (Merged) Source PMID Relationship Type Evidence Score
Acute leukemia Leukemia BEFREE 28724437
★☆☆☆☆
Found in Text Mining only
Acute lymphocytic leukemia Lymphocytic Leukemia BEFREE 23152540, 24997151
★☆☆☆☆
Found in Text Mining only
Adenocarcinoma Adenocarcinoma BEFREE 14978140, 15665292, 25567207, 25644061
★☆☆☆☆
Found in Text Mining only
Adenocarcinoma Of Esophagus Esophageal Cancer BEFREE 25644061
★☆☆☆☆
Found in Text Mining only
Adenocarcinoma of lung (disorder) Lung adenocarcinoma BEFREE 29131257, 30840278
★☆☆☆☆
Found in Text Mining only
Adenoid Cystic Carcinoma Adenocarcinoma LHGDN 16636670
★☆☆☆☆
Found in Text Mining only
Adenoid Cystic Carcinoma Adenocarcinoma CTD_human_DG 16762588
★☆☆☆☆
Found in Text Mining only
Adult Medulloblastoma Medulloblastoma BEFREE 12125983, 15064731, 18577562
★☆☆☆☆
Found in Text Mining only
Adult T-Cell Lymphoma/Leukemia T-Cell Lymphoma/Leukemia BEFREE 23482931
★☆☆☆☆
Found in Text Mining only
Agenesis of corpus callosum Agenesis Of Corpus Callosum BEFREE 12368205
★☆☆☆☆
Found in Text Mining only