Gene Gene information from NCBI Gene database.
Entrez ID 29894
Gene name Cleavage and polyadenylation specific factor 1
Gene symbol CPSF1
Synonyms (NCBI Gene)
CPSF160HSU37012MYP27P/cl.18
Chromosome 8
Chromosome location 8q24.3
Summary Cleavage and polyadenylation specificity factor (CPSF) is a multisubunit complex that plays a central role in 3-prime processing of pre-mRNAs. CPSF recognizes the AAUAAA signal in the pre-mRNA and interacts with other proteins to facilitate both RNA cleav
SNPs SNP information provided by dbSNP.
2
SNP ID Visualize variation Clinical significance Consequence
rs782640869 CT>- Pathogenic Frameshift variant, coding sequence variant
rs1586620121 G>A Pathogenic Stop gained, coding sequence variant
miRNA miRNA information provided by mirtarbase database.
51
miRTarBase ID miRNA Experiments Reference
MIRT004179 hsa-miR-197-3p Microarray 16822819
MIRT023819 hsa-miR-1-3p Proteomics 18668040
MIRT028569 hsa-miR-30a-5p Proteomics 18668040
MIRT032424 hsa-let-7b-5p Proteomics 18668040
MIRT051016 hsa-miR-17-5p CLASH 23622248
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
15
GO ID Ontology Definition Evidence Reference
GO:0003676 Function Nucleic acid binding IEA
GO:0003723 Function RNA binding IEA
GO:0005515 Function Protein binding IPI 7590244, 21102410, 21822216, 25416956, 32296183
GO:0005634 Component Nucleus IBA
GO:0005634 Component Nucleus IEA
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
606027 2324 ENSG00000071894
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
Q10570
Protein name Cleavage and polyadenylation specificity factor subunit 1 (Cleavage and polyadenylation specificity factor 160 kDa subunit) (CPSF 160 kDa subunit)
Protein function Component of the cleavage and polyadenylation specificity factor (CPSF) complex that plays a key role in pre-mRNA 3'-end formation, recognizing the AAUAAA signal sequence and interacting with poly(A) polymerase and other factors to bring about c
PDB 6BLY , 6BM0 , 6DNH , 6F9N , 6FUW , 6URG , 6URO , 8E3I , 8E3Q , 8R8R
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF10433 MMS1_N 92 686 Domain
PF03178 CPSF_A 1073 1409 CPSF A subunit region Family
Tissue specificity TISSUE SPECIFICITY: Widely expressed, with high expression in the retina. {ECO:0000269|PubMed:30689892}.
Sequence
MYAVYKQAHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYRLNRDAEALTKNDRSTEGK
AHREKLELAASFSFFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSL
HYFEEPELRDGFVQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEG
QRSSFLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSI
VAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVAL
NSLTTGTTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAF
HFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSK
KKRVDATAGWSAAGKSVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAV
GEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIA
PVRKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTSGF
ATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQLHFIPVDLGAPIVQCAVADPYVVIMS
AEGHVTMFLLKSDSYGGRHHRLALHK
PPLHHQSKVITLCLYRDLSGMFTTESRLGGARDE
LGGRSGPEAEGLGSETSPTVDDEEEMLYGDSGSLFSPSKEEARRSSQPPADRDPAPFRAE
PTHWCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEARREEATRQ
GELPLVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNIN
FREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGA
LRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRC
TAHYVAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLIS
PVSWEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRIL
IMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASEL
TGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDF
MVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGAT
EGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRA
FRMLHVDRRTLQNAVRNVLDGELLNRYLY
LSTMERSELAKKIGTTPDIILDDLLETDRVT
AHF
Sequence length 1443
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG  Reactome
  mRNA surveillance pathway   Transport of Mature mRNA Derived from an Intronless Transcript
tRNA processing in the nucleus
mRNA Splicing - Major Pathway
mRNA 3'-end processing
RNA Polymerase II Transcription Termination
Processing of Intronless Pre-mRNAs
Associated diseases Disease associations from ClinVar categorized as Causal (Pathogenic/Likely Pathogenic) or Unknown.
15
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Myopia 27 Likely pathogenic; Pathogenic rs1821166689, rs2537385296, rs782555528, rs1554862601, rs1586620121 RCV003991838
RCV004566414
RCV001029433
RCV001029435
RCV001029437
Unknown Diseases with uncertain, conflicting, or no pathogenic evidence in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
CPSF1-related disorder Uncertain significance; Likely benign; Benign rs1554865719, rs140031041, rs1226042676, rs781945735, rs148200138 RCV003416932
RCV003909624
RCV003956951
RCV003946772
RCV003968920
Gastric cancer - rs141624286 RCV006009020
Associations from Text MiningDisease associations identified through Pubtator
Disease Name Relationship Type References
Alternating hemiplegia of childhood Associate 32929364
Breast Neoplasms Associate 32929364
Carcinogenesis Associate 32929364
Lung Neoplasms Associate 28634396
Myopia Associate 35002215, 37191617
Neoplasm Metastasis Associate 32929364
Neoplasms Associate 28634396
Prostatic Neoplasms Castration Resistant Associate 28928128
Retinal Diseases Associate 37191617
Squamous Cell Carcinoma of Head and Neck Associate 32437477