Gene Gene information from NCBI Gene database.
Entrez ID 10664
Gene name CCCTC-binding factor
Gene symbol CTCF
Synonyms (NCBI Gene)
CFAP108FAP108MRD21
Chromosome 16
Chromosome location 16q22.1
Summary This gene is a member of the BORIS + CTCF gene family and encodes a transcriptional regulator protein with 11 highly conserved zinc finger (ZF) domains. This nuclear protein is able to use different combinations of the ZF domains to bind different DNA tar
SNPs SNP information provided by dbSNP.
21
SNP ID Visualize variation Clinical significance Consequence
rs200677445 C>A,T Pathogenic Stop gained, missense variant, coding sequence variant
rs879255516 C>T Pathogenic Missense variant, coding sequence variant
rs879255570 ->T Pathogenic Coding sequence variant, frameshift variant, intron variant
rs879255571 ->A Pathogenic Frameshift variant, coding sequence variant
rs886039600 A>G Likely-pathogenic Missense variant, coding sequence variant
miRNA miRNA information provided by mirtarbase database.
325
miRTarBase ID miRNA Experiments Reference
MIRT024342 hsa-miR-215-5p Microarray 19074876
MIRT026312 hsa-miR-192-5p Microarray 19074876
MIRT030430 hsa-miR-24-3p Microarray 19748357
MIRT052311 hsa-let-7b-5p CLASH 23622248
MIRT040249 hsa-miR-615-3p CLASH 23622248
Transcription factors Transcription factors information provided by TRRUST V2 database.
5
Transcription factor Regulation Reference
BCL3 Repression 21912613
FOXH1 Unknown 19956589
NFKB1 Repression 21912613
WT1 Unknown 24534946
YY1 Activation 9756895
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
66
GO ID Ontology Definition Evidence Reference
GO:0000122 Process Negative regulation of transcription by RNA polymerase II IDA 8649389
GO:0000122 Process Negative regulation of transcription by RNA polymerase II IEA
GO:0000122 Process Negative regulation of transcription by RNA polymerase II IMP 34856126
GO:0000775 Component Chromosome, centromeric region IDA 18550811, 26321640
GO:0000775 Component Chromosome, centromeric region IEA
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
604167 13723 ENSG00000102974
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
P49711
Protein name Transcriptional repressor CTCF (11-zinc finger protein) (CCCTC-binding factor) (CTCFL paralog)
Protein function Chromatin binding factor that binds to DNA sequence specific sites and regulates the 3D structure of chromatin (PubMed:18347100, PubMed:18654629, PubMed:19322193). Binds together strands of DNA, thus forming chromatin loops, and anchors DNA to c
PDB 1X6H , 2CT1 , 5K5H , 5K5I , 5K5J , 5K5L , 5KKQ , 5T00 , 5T0U , 5UND , 5YEF , 5YEG , 5YEH , 5YEL , 6QNX , 7W1M , 8SSQ , 8SSR , 8SSS , 8SST , 8SSU
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00096 zf-C2H2 266 288 Zinc finger, C2H2 type Domain
PF00096 zf-C2H2 294 316 Zinc finger, C2H2 type Domain
PF00096 zf-C2H2 322 345 Zinc finger, C2H2 type Domain
PF13909 zf-H2C2_5 351 375 Domain
PF00096 zf-C2H2 379 401 Zinc finger, C2H2 type Domain
PF00096 zf-C2H2 437 460 Zinc finger, C2H2 type Domain
PF00096 zf-C2H2 555 575 Zinc finger, C2H2 type Domain
Tissue specificity TISSUE SPECIFICITY: Ubiquitous. Absent in primary spermatocytes. {ECO:0000269|PubMed:9591631}.
Sequence
MEGDAVEAIVEESETFIKGKERKTYQRRREGGQEEDACHLPQNQTDGGEVVQDVNSSVQM
VMMEQLDPTLLQMKTEVMEGTVAPEAEAAVDDTQIITLQVVNMEEQPINIGELQLVQVPV
PVTVPVATTSVEELQGAYENEVSKEGLAESEPMICHTLPLPEGFQVVKVGANGEVETLEQ
GELPPQEDPSWQKDPDYQPPAKKTKKTKKSKLRYTEEGKDVDVSVYDFEEEQQEGLLSEV
NAEKVVGNMKPPKPTKIKKKGVKKTFQCELCSYTCPRRSNLDRHMKSHTDERPHKCHLCG
RAFRTVTLLRNHLNTH
TGTRPHKCPDCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYAS
VEVSKLKRHIRSHTG
ERPFQCSLCSYASRDTYKLKRHMRTHSGEKPYECYICHARFTQSG
TMKMHILQKHTENVAKFHCPHCDTVIARKSDLGVHLRKQHSYIEQGKKCRYCDAVFHERY
ALIQHQKSHKNEKRFKCDQCDYACRQERHMIMHKRTHTGEKPYACSHCDKTFRQKQLLDM
HFKRYHDPNFVPAAFVCSKCGKTFTRRNTMARHADNCAGPDGVEGENGGETKKSKRGRKR
KMRSKKEDSSDSENAEPDLDDNEDEEEPAVEIEPEPEPQPVTPAPPPAKKRRGRPPGRTN
QPKQNQPTAIIQVEDQNTGAIENIIVEVKKEPDAEPAEGEEEEAQPAATDAPNGDLTPEM
ILSMMDR
Sequence length 727
Interactions View interactions
Associated diseases Disease associations from ClinVar (causal & non-causal) and other databases (OMIM, Orphanet, GWAS, etc.).
32
Evidence Score: ★☆☆☆☆  Gene-disease association found in Text Mining only ★★☆☆☆  Found in Text Mining and Unknown/Other Associations ★★★☆☆  Reported in Unknown/Other Associations across ≥2 Sources ★★★★☆  ClinVar: Pathogenic/Likely Pathogenic (<5 Variants) ★★★★★  ClinVar: Pathogenic/Likely Pathogenic (≥5 Variants)
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession Evidence Score
Acute megakaryoblastic leukemia in down syndrome Likely pathogenic rs2052291685, rs2052320698 RCV001293758
RCV001293760
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
Alveolar rhabdomyosarcoma Pathogenic rs886041997 RCV006253936
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
CTCF-related disorder Pathogenic; Likely pathogenic rs2142849782, rs776804939, rs1131691283, rs2052136913 RCV004738344
RCV004536741
RCV004535549
RCV001249458
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
CTCF-related neurodevelopmental disorder Likely pathogenic; Pathogenic rs2142827936, rs2142847350, rs2142849782, rs1002125753, rs2142826609, rs2142823841, rs2142826656, rs2142828055, rs2142887156, rs2052136913, rs2142839453, rs1555535156, rs1259610303, rs2142847326, rs2142847343
View all (24 more)
RCV001376022
RCV001375934
RCV001780408
RCV001780414
RCV002471133
View all (35 more)
★★★★★
ClinVar: Pathogenic / Likely Pathogenic (≥5 Variants)
Unknown / Other Associations ClinVar entries with uncertain/conflicting evidence, and associations from other databases (OMIM, Orphanet, GWAS, etc.) where the gene is not established as causal.
Phenotype Name Clinical Significance Source Evidence Score
Acute myeloid leukemia Benign ClinVar
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
ALZHEIMER DISEASE GWAS catalog
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Cervical cancer Benign; Uncertain significance ClinVar
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Cholangiocarcinoma Benign ClinVar
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Associations from Text Mining Disease associations identified through text mining
Disease Name Disease (Merged) Source PMID Relationship Type Evidence Score
Acute lymphocytic leukemia Lymphocytic Leukemia BEFREE 21390130, 24393203, 30537513
★☆☆☆☆
Found in Text Mining only
Acute Promyelocytic Leukemia Promyelocytic Leukemia BEFREE 24493669
★☆☆☆☆
Found in Text Mining only
Adult Acute Lymphocytic Leukemia Lymphocytic Leukemia BEFREE 24393203, 30537513
★☆☆☆☆
Found in Text Mining only
Adult T-Cell Lymphoma/Leukemia T-Cell Lymphoma/Leukemia BEFREE 23698277
★☆☆☆☆
Found in Text Mining only
Alzheimer Disease Alzheimer disease Pubtator 31262699, 31500627, 37302762 Associate
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Amyotrophic Lateral Sclerosis Amyotrophic lateral sclerosis Pubtator 32732921 Associate
★☆☆☆☆
Found in Text Mining only
Anodontia Anodontia Pubtator 35175239 Associate
★☆☆☆☆
Found in Text Mining only
Arteriosclerosis Arteriosclerosis BEFREE 31757932
★☆☆☆☆
Found in Text Mining only
Arthritis Juvenile Juvenile arthritis Pubtator 33597588 Associate
★☆☆☆☆
Found in Text Mining only
Arthritis Rheumatoid Rheumatoid arthritis Pubtator 35085847 Associate
★☆☆☆☆
Found in Text Mining only