Gene
Entrez ID Entrez Gene ID - the GENE ID in NCBI Gene database.
1063
Gene name Gene Name - the full gene name approved by the HGNC.
Centromere protein F
Gene symbol Gene Symbol - the official gene symbol approved by the HGNC.
CENPF
Synonyms (NCBI Gene) Gene synonyms aliases
CENF, CILD31, PRO1779, STROMS, hcp-1
Disease Acronyms (UniProt) Disease acronyms from UniProt database
STROMS
Chromosome Chromosome number
1
Chromosome location Chromosomal Location - indicates the cytogenetic location of the gene or region on the chromosome.
1q41
Summary Summary of gene provided in NCBI Entrez Gene.
This gene encodes a protein that associates with the centromere-kinetochore complex. The protein is a component of the nuclear matrix during the G2 phase of interphase. In late G2 the protein associates with the kinetochore and maintains this association
SNPs SNP information provided by dbSNP.
SNP ID Visualize variation Clinical significance Consequence
rs141352776 G>A,T Likely-pathogenic Coding sequence variant, stop gained, missense variant
rs142993088 A>C Conflicting-interpretations-of-pathogenicity Coding sequence variant, missense variant
rs144237457 A>G Conflicting-interpretations-of-pathogenicity, uncertain-significance Coding sequence variant, missense variant
rs200976140 G>T Pathogenic Coding sequence variant, stop gained
rs367624766 G>A,T Pathogenic Missense variant, coding sequence variant, stop gained
miRNA miRNA information provided by mirtarbase database.
miRTarBase ID miRNA Experiments Reference
MIRT002525 hsa-miR-373-3p Microarray 15685193
MIRT019840 hsa-miR-375 Microarray 20215506
MIRT002525 hsa-miR-373-3p Microarray;Other 15685193
MIRT023287 hsa-miR-122-5p Proteomics 21750653
MIRT023780 hsa-miR-1-3p Proteomics 18668040
Gene ontology (GO) Gene ontology information of associated ontologies with gene provided by GO database.
GO ID Ontology Definition Evidence Reference
GO:0000278 Process Mitotic cell cycle IBA 21873635
GO:0000278 Process Mitotic cell cycle IMP 7542657
GO:0000775 Component Chromosome, centromeric region IBA 21873635
GO:0000775 Component Chromosome, centromeric region IDA 7542657, 7651420, 11084331
GO:0000776 Component Kinetochore IDA 7542657, 9763420, 17363900
Other IDs Other ids provides unique ids of gene in databases such as OMIM, HGNC, ENSEMBLE.
MIM HGNC e!Ensembl
600236 1857 ENSG00000117724
Protein
UniProt ID P49454
Protein name Centromere protein F (CENP-F) (AH antigen) (Kinetochore protein CENPF) (Mitosin)
Protein function Required for kinetochore function and chromosome segregation in mitosis. Required for kinetochore localization of dynein, LIS1, NDE1 and NDEL1. Regulates recycling of the plasma membrane by acting as a link between recycling vesicles and the mic
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF10481 CENP-F_N 1 307 Cenp-F N-terminal domain Coiled-coil
PF10473 CENP-F_leu_zip 1893 2035 Leucine-rich repeats of kinetochore protein Cenp-F/LEK1 Coiled-coil
PF10473 CENP-F_leu_zip 2131 2270 Leucine-rich repeats of kinetochore protein Cenp-F/LEK1 Coiled-coil
PF10473 CENP-F_leu_zip 2313 2452 Leucine-rich repeats of kinetochore protein Cenp-F/LEK1 Coiled-coil
PF10490 CENP-F_C_Rb_bdg 2967 3013 Rb-binding domain of kinetochore protein Cenp-F/LEK1 Domain
Sequence
MSWALEEWKEGLPTRALQKIQELEGQLDKLKKEKQQRQFQLDSLEAALQKQKQKVENEKT
EGTNLKRENQRLMEICESLEKTKQKISHELQVKESQVNFQEGQLNSGKKQIEKLEQELKR
CKSELERSQQAAQSADVSLNPCNTPQKIFTTPLTPSQYYSGSKYEDLKEKYNKEVEERKR
LEAEVKALQAKKASQTLPQATMNHRDIARHQASSSVFSWQQEKTPSHLSSNSQRTPIRRD
FSASYFSGEQEVTPSRSTLQIGKRDANSSFFDNSSSPHLLDQLKAQNQELRNKINELELR
LQGHEKE
MKGQVNKFQELQLQLEKAKVELIEKEKVLNKCRDELVRTTAQYDQASTKYTAL
EQKLKKLTEDLSCQRQNAESARCSLEQKIKEKEKEFQEELSRQQRSFQTLDQECIQMKAR
LTQELQQAKNMHNVLQAELDKLTSVKQQLENNLEEFKQKLCRAEQAFQASQIKENELRRS
MEEMKKENNLLKSHSEQKAREVCHLEAELKNIKQCLNQSQNFAEEMKAKNTSQETMLRDL
QEKINQQENSLTLEKLKLAVADLEKQRDCSQDLLKKREHHIEQLNDKLSKTEKESKALLS
ALELKKKEYEELKEEKTLFSCWKSENEKLLTQMESEKENLQSKINHLETCLKTQQIKSHE
YNERVRTLEMDRENLSVEIRNLHNVLDSKSVEVETQKLAYMELQQKAEFSDQKHQKEIEN
MCLKTSQLTGQVEDLEHKLQLLSNEIMDKDRCYQDLHAEYESLRDLLKSKDASLVTNEDH
QRSLLAFDQQPAMHHSFANIIGEQGSMPSERSECRLEADQSPKNSAILQNRVDSLEFSLE
SQKQMNSDLQKQCEELVQIKGEIEENLMKAEQMHQSFVAETSQRISKLQEDTSAHQNVVA
ETLSALENKEKELQLLNDKVETEQAEIQELKKSNHLLEDSLKELQLLSETLSLEKKEMSS
IISLNKREIEELTQENGTLKEINASLNQEKMNLIQKSESFANYIDEREKSISELSDQYKQ
EKLILLQRCEETGNAYEDLSQKYKAAQEKNSKLECLLNECTSLCENRKNELEQLKEAFAK
EHQEFLTKLAFAEERNQNLMLELETVQQALRSEMTDNQNNSKSEAGGLKQEIMTLKEEQN
KMQKEVNDLLQENEQLMKVMKTKHECQNLESEPIRNSVKERESERNQCNFKPQMDLEVKE
ISLDSYNAQLVQLEAMLRNKELKLQESEKEKECLQHELQTIRGDLETSNLQDMQSQEISG
LKDCEIDAEEKYISGPHELSTSQNDNAHLQCSLQTTMNKLNELEKICEILQAEKYELVTE
LNDSRSECITATRKMAEEVGKLLNEVKILNDDSGLLHGELVEDIPGGEFGEQPNEQHPVS
LAPLDESNSYEHLTLSDKEVQMHFAELQEKFLSLQSEHKILHDQHCQMSSKMSELQTYVD
SLKAENLVLSTNLRNFQGDLVKEMQLGLEEGLVPSLSSSCVPDSSSLSSLGDSSFYRALL
EQTGDMSLLSNLEGAVSANQCSVDEVFCSSLQEENLTRKETPSAPAKGVEELESLCEVYR
QSLEKLEEKMESQGIMKNKEIQELEQLLSSERQELDCLRKQYLSENEQWQQKLTSVTLEM
ESKLAAEKKQTEQLSLELEVARLQLQGLDLSSRSLLGIDTEDAIQGRNESCDISKEHTSE
TTERTPKHDVHQICDKDAQQDLNLDIEKITETGAVKPTGECSGEQSPDTNYEPPGEDKTQ
GSSECISELSFSGPNALVPMDFLGNQEDIHNLQLRVKETSNENLRLLHVIEDRDRKVESL
LNEMKELDSKLHLQEVQLMTKIEACIELEKIVGELKKENSDLSEKLEYFSCDHQELLQRV
ETSEGLNSDLEMHADKSSREDIGDNVAKVNDSWKERFLDVENELSRIRSEKASIEHEALY
LEADLEVVQTEKLCLEKDNENKQKVIVCLEEELSVVTSERNQLRGELDTMSKKTTALDQL
SEKMKEKTQELESHQSECLHCIQVAEAEVKEKTELLQTLSSDVSELLKDKTHLQE
KLQSL
EKDSQALSLTKCELENQIAQLNKEKELLVKESESLQARLSESDYEKLNVSKALEAALVEK
GEFALRLSSTQEEVHQLRRGIEKLRVRIEADEKKQLHIAEKLKERERENDSLKDKVENLE
RELQMSEENQELVILDAENSKAEVETLKTQIEEMARSLKVFELDLVTLRSEKENLTKQIQ
EKQGQLSELDKLLSSFKSLLEEKEQAEIQIKEESKTAVEMLQNQLKELNE
AVAALCGDQE
IMKATEQSLDPPIEEEHQLRNSIEKLRARLEADEKKQLCVLQQLKESEHHADLLKGRVEN
LERELEIARTNQEHAALEAENSKGEVETLKAKIEGMTQSLRGLELDVVTIRSEKENLTNE
LQKEQERISELEIINSSFENILQEKEQEKVQMKEKSSTAMEMLQTQLKELNE
RVAALHND
QEACKAKEQNLSSQVECLELEKAQLLQGLDEAKNNYIVLQSSVNGLIQEVEDGKQKLEKK
DEEISRLKNQIQDQEQLVSKLSQVEGEHQLWKEQNLELRNLTVELEQKIQVLQSKNASLQ
DTLEVLQSSYKNLENELELTKMDKMSFVEKVNKMTAKETELQREMHEMAQKTAELQEELS
GEKNRLAGELQLLLEEIKSSKDQLKELTLENSELKKSLDCMHKDQVEKEGKVREEIAEYQ
LRLHEAEKKHQALLLDTNKQYEVEIQTYREKLTSKEECLSSQKLEIDLLKSSKEELNNSL
KATTQILEELKKTKMDNLKYVNQLKKENERAQGKMKLLIKSCKQLEEEKEILQKELSQLQ
AAQEKQKTGTVMDTKVDELTTEIKELKETLEEKTKEADEYLDKYCSLLISHEKLEKAKEM
LETQVAHLCSQQSKQDSRGSPLLGPVVPGPSPIPSVTEKRLSSGQNKASGKRQRSSGIWE
NGRGPTPATPESFSKKSKKAVMSGIHPAEDTEGTEFEPEGLPEVVKKGFADIPTGKTSPY
ILRRTTMATRTSP
RLAAQKLALSPLSLGKENLAESSKPTAGGSRSQKVKVAQRSPVDSGT
ILREPTTKSVPVNNLPERSPTDSPREGLRVKRGRLVPSPKAGLESNGSENCKVQ
Sequence length 3114
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  Reactome
    Amplification of signal from unattached kinetochores via a MAD2 inhibitory signal
Polo-like kinase mediated events
Separation of Sister Chromatids
Resolution of Sister Chromatid Cohesion
RHO GTPases Activate Formins
Mitotic Prometaphase
EML4 and NUDC in mitotic spindle formation
Associated diseases Disease information provided by ClinVar, GenCC, and GWAS databases.
Causal
Disease term Disease name dbSNP ID References
Agenesis of corpus callosum Agenesis of corpus callosum rs754914260, rs1057519053, rs1057519056, rs1057519054, rs1057519055, rs1057519057, rs1384496494, rs1599017933
Anterior segment dysgenesis Irido-corneo-trabecular dysgenesis (disorder) rs121907917, rs72549387, rs121909248, rs104893861, rs104893862, rs80358194, rs2113111009, rs104893957, rs104893958, rs104893954, rs587778873, rs587778874, rs878853070, rs752281590, rs369858688
View all (8 more)
Breast cancer Malignant neoplasm of breast rs587776547, rs1137887, rs137853007, rs587776650, rs80359351, rs80359714, rs121917783, rs104886456, rs121964878, rs80359874, rs80357868, rs80357508, rs387906843, rs80357569, rs80358158
View all (309 more)
17659439
Breast carcinoma Breast Carcinoma rs80359671, rs11540652, rs28934575, rs28897672, rs137886232, rs193922376, rs80357783, rs80359306, rs80359405, rs80359507, rs80359598, rs80358429, rs397507683, rs397515636, rs80359451
View all (71 more)
17659439
Unknown
Disease term Disease name Evidence References Source
Oligodendroglioma Oligodendroglioma GWAS
Associations from Text Mining
Disease Name Relationship Type References
Adenocarcinoma of Lung Associate 32672359, 33148251, 35354488, 36260870
Adenocarcinoma of Lung Stimulate 33398330
Adrenocortical Carcinoma Associate 35123514
Aneuploidy Associate 17205517
Astrocytoma Associate 12937144
Breast Neoplasms Associate 12115341, 17205517, 19102762, 20015195, 26868636, 32210727, 35456460, 36096037, 37592220, 39242688
Carcinogenesis Associate 40659783
Carcinoma Adenoid Cystic Associate 31914060
Carcinoma Hepatocellular Associate 26137588, 33157984, 33946043
Carcinoma Pancreatic Ductal Associate 15548371, 29483831