Gene
Entrez ID Entrez Gene ID - the GENE ID in NCBI Gene database.
29894
Gene name Gene Name - the full gene name approved by the HGNC.
Cleavage and polyadenylation specific factor 1
Gene symbol Gene Symbol - the official gene symbol approved by the HGNC.
CPSF1
Synonyms (NCBI Gene) Gene synonyms aliases
CPSF160, HSU37012, MYP27, P/cl.18
Disease Acronyms (UniProt) Disease acronyms from UniProt database
MYP27
Chromosome Chromosome number
8
Chromosome location Chromosomal Location - indicates the cytogenetic location of the gene or region on the chromosome.
8q24.3
Summary Summary of gene provided in NCBI Entrez Gene.
Cleavage and polyadenylation specificity factor (CPSF) is a multisubunit complex that plays a central role in 3-prime processing of pre-mRNAs. CPSF recognizes the AAUAAA signal in the pre-mRNA and interacts with other proteins to facilitate both RNA cleav
SNPs SNP information provided by dbSNP.
SNP ID Visualize variation Clinical significance Consequence
rs782640869 CT>- Pathogenic Frameshift variant, coding sequence variant
rs1586620121 G>A Pathogenic Stop gained, coding sequence variant
miRNA miRNA information provided by mirtarbase database.
miRTarBase ID miRNA Experiments Reference
MIRT004179 hsa-miR-197-3p Microarray 16822819
MIRT023819 hsa-miR-1-3p Proteomics 18668040
MIRT028569 hsa-miR-30a-5p Proteomics 18668040
MIRT032424 hsa-let-7b-5p Proteomics 18668040
MIRT051016 hsa-miR-17-5p CLASH 23622248
Gene ontology (GO) Gene ontology information of associated ontologies with gene provided by GO database.
GO ID Ontology Definition Evidence Reference
GO:0000398 Process MRNA splicing, via spliceosome TAS
GO:0005515 Function Protein binding IPI 7590244, 21102410, 21822216, 25416956, 32296183
GO:0005634 Component Nucleus IBA 21873635
GO:0005654 Component Nucleoplasm IDA
GO:0005654 Component Nucleoplasm TAS
Other IDs Other ids provides unique ids of gene in databases such as OMIM, HGNC, ENSEMBLE.
MIM HGNC e!Ensembl
606027 2324 ENSG00000071894
Protein
UniProt ID Q10570
Protein name Cleavage and polyadenylation specificity factor subunit 1 (Cleavage and polyadenylation specificity factor 160 kDa subunit) (CPSF 160 kDa subunit)
Protein function Component of the cleavage and polyadenylation specificity factor (CPSF) complex that plays a key role in pre-mRNA 3'-end formation, recognizing the AAUAAA signal sequence and interacting with poly(A) polymerase and other factors to bring about c
PDB 6BLY , 6BM0 , 6DNH , 6F9N , 6FUW , 6URG , 6URO , 8E3I , 8E3Q , 8R8R
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF10433 MMS1_N 92 686 Domain
PF03178 CPSF_A 1073 1409 CPSF A subunit region Family
Tissue specificity TISSUE SPECIFICITY: Widely expressed, with high expression in the retina. {ECO:0000269|PubMed:30689892}.
Sequence
MYAVYKQAHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYRLNRDAEALTKNDRSTEGK
AHREKLELAASFSFFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSL
HYFEEPELRDGFVQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEG
QRSSFLPSYIIDVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSI
VAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVAL
NSLTTGTTAFPLRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAF
HFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSK
KKRVDATAGWSAAGKSVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAV
GEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIA
PVRKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTGQEIMELDTSGF
ATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQLHFIPVDLGAPIVQCAVADPYVVIMS
AEGHVTMFLLKSDSYGGRHHRLALHK
PPLHHQSKVITLCLYRDLSGMFTTESRLGGARDE
LGGRSGPEAEGLGSETSPTVDDEEEMLYGDSGSLFSPSKEEARRSSQPPADRDPAPFRAE
PTHWCLLVRENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEARREEATRQ
GELPLVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNIN
FREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGA
LRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRC
TAHYVAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLIS
PVSWEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRIL
IMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASEL
TGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDF
MVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGAT
EGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRA
FRMLHVDRRTLQNAVRNVLDGELLNRYLY
LSTMERSELAKKIGTTPDIILDDLLETDRVT
AHF
Sequence length 1443
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG   Reactome
  mRNA surveillance pathway   Transport of Mature mRNA Derived from an Intronless Transcript
tRNA processing in the nucleus
mRNA Splicing - Major Pathway
mRNA 3'-end processing
RNA Polymerase II Transcription Termination
Processing of Intronless Pre-mRNAs
Associated diseases Disease information provided by ClinVar, GenCC, and GWAS databases.
Causal
Disease term Disease name dbSNP ID References
Gastric cancer Hereditary Diffuse Gastric Cancer rs137854571, rs63751108, rs34612342, rs121908383, rs121909144, rs121909775, rs121909219, rs121909223, rs63750871, rs80359530, rs121964873, rs121913530, rs606231203, rs121918505, rs587776802
View all (244 more)
21364753
Unknown
Disease term Disease name Evidence References Source
Myopia myopia 27 GenCC
Associations from Text Mining
Disease Name Relationship Type References
Alternating hemiplegia of childhood Associate 32929364
Breast Neoplasms Associate 32929364
Carcinogenesis Associate 32929364
Lung Neoplasms Associate 28634396
Myopia Associate 35002215, 37191617
Neoplasm Metastasis Associate 32929364
Neoplasms Associate 28634396
Prostatic Neoplasms Castration Resistant Associate 28928128
Retinal Diseases Associate 37191617
Squamous Cell Carcinoma of Head and Neck Associate 32437477