Gene Gene information from NCBI Gene database.
Entrez ID 10664
Gene name CCCTC-binding factor
Gene symbol CTCF
Synonyms (NCBI Gene)
CFAP108FAP108MRD21
Chromosome 16
Chromosome location 16q22.1
Summary This gene is a member of the BORIS + CTCF gene family and encodes a transcriptional regulator protein with 11 highly conserved zinc finger (ZF) domains. This nuclear protein is able to use different combinations of the ZF domains to bind different DNA tar
SNPs SNP information provided by dbSNP.
21
SNP ID Visualize variation Clinical significance Consequence
rs200677445 C>A,T Pathogenic Stop gained, missense variant, coding sequence variant
rs879255516 C>T Pathogenic Missense variant, coding sequence variant
rs879255570 ->T Pathogenic Coding sequence variant, frameshift variant, intron variant
rs879255571 ->A Pathogenic Frameshift variant, coding sequence variant
rs886039600 A>G Likely-pathogenic Missense variant, coding sequence variant
miRNA miRNA information provided by mirtarbase database.
325
miRTarBase ID miRNA Experiments Reference
MIRT024342 hsa-miR-215-5p Microarray 19074876
MIRT026312 hsa-miR-192-5p Microarray 19074876
MIRT030430 hsa-miR-24-3p Microarray 19748357
MIRT052311 hsa-let-7b-5p CLASH 23622248
MIRT040249 hsa-miR-615-3p CLASH 23622248
Transcription factors Transcription factors information provided by TRRUST V2 database.
5
Transcription factor Regulation Reference
BCL3 Repression 21912613
FOXH1 Unknown 19956589
NFKB1 Repression 21912613
WT1 Unknown 24534946
YY1 Activation 9756895
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
66
GO ID Ontology Definition Evidence Reference
GO:0000122 Process Negative regulation of transcription by RNA polymerase II IDA 8649389
GO:0000122 Process Negative regulation of transcription by RNA polymerase II IEA
GO:0000122 Process Negative regulation of transcription by RNA polymerase II IMP 34856126
GO:0000775 Component Chromosome, centromeric region IDA 18550811, 26321640
GO:0000775 Component Chromosome, centromeric region IEA
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
604167 13723 ENSG00000102974
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
P49711
Protein name Transcriptional repressor CTCF (11-zinc finger protein) (CCCTC-binding factor) (CTCFL paralog)
Protein function Chromatin binding factor that binds to DNA sequence specific sites and regulates the 3D structure of chromatin (PubMed:18347100, PubMed:18654629, PubMed:19322193). Binds together strands of DNA, thus forming chromatin loops, and anchors DNA to c
PDB 1X6H , 2CT1 , 5K5H , 5K5I , 5K5J , 5K5L , 5KKQ , 5T00 , 5T0U , 5UND , 5YEF , 5YEG , 5YEH , 5YEL , 6QNX , 7W1M , 8SSQ , 8SSR , 8SSS , 8SST , 8SSU
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00096 zf-C2H2 266 288 Zinc finger, C2H2 type Domain
PF00096 zf-C2H2 294 316 Zinc finger, C2H2 type Domain
PF00096 zf-C2H2 322 345 Zinc finger, C2H2 type Domain
PF13909 zf-H2C2_5 351 375 Domain
PF00096 zf-C2H2 379 401 Zinc finger, C2H2 type Domain
PF00096 zf-C2H2 437 460 Zinc finger, C2H2 type Domain
PF00096 zf-C2H2 555 575 Zinc finger, C2H2 type Domain
Tissue specificity TISSUE SPECIFICITY: Ubiquitous. Absent in primary spermatocytes. {ECO:0000269|PubMed:9591631}.
Sequence
MEGDAVEAIVEESETFIKGKERKTYQRRREGGQEEDACHLPQNQTDGGEVVQDVNSSVQM
VMMEQLDPTLLQMKTEVMEGTVAPEAEAAVDDTQIITLQVVNMEEQPINIGELQLVQVPV
PVTVPVATTSVEELQGAYENEVSKEGLAESEPMICHTLPLPEGFQVVKVGANGEVETLEQ
GELPPQEDPSWQKDPDYQPPAKKTKKTKKSKLRYTEEGKDVDVSVYDFEEEQQEGLLSEV
NAEKVVGNMKPPKPTKIKKKGVKKTFQCELCSYTCPRRSNLDRHMKSHTDERPHKCHLCG
RAFRTVTLLRNHLNTH
TGTRPHKCPDCDMAFVTSGELVRHRRYKHTHEKPFKCSMCDYAS
VEVSKLKRHIRSHTG
ERPFQCSLCSYASRDTYKLKRHMRTHSGEKPYECYICHARFTQSG
TMKMHILQKHTENVAKFHCPHCDTVIARKSDLGVHLRKQHSYIEQGKKCRYCDAVFHERY
ALIQHQKSHKNEKRFKCDQCDYACRQERHMIMHKRTHTGEKPYACSHCDKTFRQKQLLDM
HFKRYHDPNFVPAAFVCSKCGKTFTRRNTMARHADNCAGPDGVEGENGGETKKSKRGRKR
KMRSKKEDSSDSENAEPDLDDNEDEEEPAVEIEPEPEPQPVTPAPPPAKKRRGRPPGRTN
QPKQNQPTAIIQVEDQNTGAIENIIVEVKKEPDAEPAEGEEEEAQPAATDAPNGDLTPEM
ILSMMDR
Sequence length 727
Interactions View interactions
Associated diseases Disease associations from ClinVar categorized as Causal (Pathogenic/Likely Pathogenic) or Unknown.
143
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Acute megakaryoblastic leukemia in down syndrome Likely pathogenic rs2052291685, rs2052320698 RCV001293758
RCV001293760
Alveolar rhabdomyosarcoma Pathogenic rs886041997 RCV006253936
CTCF-related disorder Pathogenic; Likely pathogenic rs2142849782, rs776804939, rs1131691283, rs2052136913 RCV004738344
RCV004536741
RCV004535549
RCV001249458
CTCF-related neurodevelopmental disorder Likely pathogenic; Pathogenic rs2142827936, rs2142847350, rs2142849782, rs1002125753, rs2142826609, rs2142823841, rs2142826656, rs2142828055, rs2142887156, rs2052136913, rs2142839453, rs1555535156, rs1259610303, rs2142847326, rs2142847343
View all (24 more)
RCV001376022
RCV001375934
RCV001780408
RCV001780414
RCV002471133
RCV001785306
RCV001785307
RCV001785309
RCV001785310
RCV001785311
RCV001785312
RCV001785313
RCV001785314
RCV001785315
RCV001785316
RCV001785317
RCV001785318
RCV001785319
RCV001785320
RCV001785325
RCV001809164
RCV001843832
RCV002283836
RCV006259426
RCV003314152
RCV003315155
RCV003494053
RCV001253429
RCV001783119
RCV001784177
RCV000681644
RCV000767531
RCV000989614
RCV001002552
RCV000074333
RCV000074334
RCV000074335
RCV001254150
RCV001265578
RCV001293705
Unknown Diseases with uncertain, conflicting, or no pathogenic evidence in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Acute myeloid leukemia Benign rs143837268 RCV005888032
Cervical cancer Benign; Uncertain significance rs143837268, rs749116676 RCV005888035
RCV005899860
Cholangiocarcinoma Benign rs143837268 RCV005888045
Colon adenocarcinoma Benign rs143837268 RCV005888031
Associations from Text MiningDisease associations identified through Pubtator
Disease Name Relationship Type References
Alzheimer Disease Associate 31262699, 31500627, 37302762
Amyotrophic Lateral Sclerosis Associate 32732921
Anodontia Associate 35175239
Arthritis Juvenile Associate 33597588
Arthritis Rheumatoid Associate 35085847
Asthenozoospermia Associate 19584898
Asthma Associate 27848966, 29228930
Atherosclerosis Associate 19322193
Autoimmune Diseases Associate 32879140, 34279042
Azoospermia Associate 37334314