Gene Gene information from NCBI Gene database.
Entrez ID 57642
Gene name Collagen type XX alpha 1 chain
Gene symbol COL20A1
Synonyms (NCBI Gene)
-
Chromosome 20
Chromosome location 20q13.33
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
7
GO ID Ontology Definition Evidence Reference
GO:0005515 Function Protein binding IPI 25416956
GO:0005576 Component Extracellular region IEA
GO:0005576 Component Extracellular region TAS
GO:0005581 Component Collagen trimer IEA
GO:0005788 Component Endoplasmic reticulum lumen IEA
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
619390 14670 ENSG00000101203
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
Q9P218
Protein name Collagen alpha-1(XX) chain
Protein function Probable collagen protein.
PDB 2DKM , 2EE3 , 2EKJ , 5KF4
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00092 VWA 179 349 von Willebrand factor type A domain Domain
PF00041 fn3 378 457 Fibronectin type III domain Domain
PF00041 fn3 468 547 Fibronectin type III domain Domain
PF00041 fn3 558 638 Fibronectin type III domain Domain
PF00041 fn3 648 729 Fibronectin type III domain Domain
PF00041 fn3 742 822 Fibronectin type III domain Domain
PF01391 Collagen 1069 1144 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1126 1193 Collagen triple helix repeat (20 copies) Repeat
Tissue specificity TISSUE SPECIFICITY: High expression in heart, lung, liver, skeletal muscle, kidney, pancreas, spleen, testis, ovary, subthalamic nucleus and fetal liver. Weak expression in other tissues tested.
Sequence
MSSGDPAHLGLCLWLWLGATLGREQVQASGLLRLAVLPEDRLQMKWRESEGSGLGYLVQV
KPMAGDSEQEVILTTKTPKATVGGLSPSKGYTLQIFELTGSGRFLLARREFVIEDLKSSS
LDRSSQRPLGSGAPEPTPSHTGSPDPEQASEPQVAFTPSQDPRTPAGPQFRCLPPVPADM
VFLVDGSWSIGHSHFQQVKDFLASVIAPFEIGPDKVQVGLTQYSGDAQTEWDLNSLSTKE
QVLAAVRRLRYKGGNTFTGLALTHVLGQNLQPAAGLRPEAAKVVILVTDGKSQDDVHTAA
RVLKDLGVNVFAVGVKNADEAELRLLASPPRDITVHSVLDFLQLGALAG
LLSRLICQRLQ
GGSPRQGPAAAPALDTLPAPTSLVLSQVTSSSIRLSWTPAPRHPLKYLIVWRASRGGTPR
EVVVEGPAASTELHNLASRTEYLVSVFPIYEGGVGEG
LRGLVTTAPLPPPRALTLAAVTP
RTVHLTWQPSAGATHYLVRCSPASPKGEEEEREVQVGRPEVLLDGLEPGRDYEVSVQSLR
GPEGSEA
RGIRARTPTLAPPRHLGFSDVSHDAARVFWEGAPRPVRLVRVTYVSSEGGHSG
QTEAPGNATSATLGPLSSSTTYTVRVTCLYPGGGSSTL
TGRVTTKKAPSPSQLSMTELPG
DAVQLAWVAAAPSGVLVYQITWTPLGEGKAHEISVPGNLGTAVLPGLGRHTEYDVTILAY
YRDGARSDP
VSLRYTPSTVSRSPPSNLALASETPDSLQVSWTPPLGRVLHYWLTYAPASG
LGPEKSVSVPGARSHVTLPDLQAATKYRVLVSAIYAAGRSEA
VSATGQTACPALRPDGSL
PGFDLMVAFSLVEKAYASIRGVAMEPSAFGGTPTFTLFKDAQLTRRVSDVYPAPLPPEHT
IVFLVRLLPETPREAFALWQMTAEDFQPLLGVLLDAGKKSLTYFHRDPRAALQEATFDPQ
EVRKIFFGSFHKVHVAVGRSKVRLYVDCRKVAERPLGEMGSPPAAGFVTLGRLAKARGPR
SSSAAFQLQMLQIVCSDTWADEDRCCELPASRDGETCPAFVSACSCSSETPGPPGPQGPP
GLPGRNGTPGEQGFPGPRGPPGVKGEKGDHGLPGLQGHPGHQGIP
GRVGLQGPKGMRGLE
GTAG
LPGPPGPRGFQGMAGARGTSGERGPPGTVGPTGLPGPKGERGEKGEPQS
LATLYQL
VSQASHVSKFDSFHENTRPPMPILEQKLEPGTEPLGSPGTRSKALVPGEWGRGGRHLEGR
GEPGAVGQMGSPGQQGASTQGLWE
Sequence length 1284
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG  Reactome
  Protein digestion and absorption   Collagen biosynthesis and modifying enzymes
Collagen chain trimerization
Associated diseases Disease associations from ClinVar (causal & non-causal) and other databases (OMIM, Orphanet, GWAS, etc.).
3
Evidence Score: ★☆☆☆☆  Gene-disease association found in Text Mining only ★★☆☆☆  Found in Text Mining and Unknown/Other Associations ★★★☆☆  Reported in Unknown/Other Associations across ≥2 Sources ★★★★☆  ClinVar: Pathogenic/Likely Pathogenic (<5 Variants) ★★★★★  ClinVar: Pathogenic/Likely Pathogenic (≥5 Variants)
Unknown / Other Associations ClinVar entries with uncertain/conflicting evidence, and associations from other databases (OMIM, Orphanet, GWAS, etc.) where the gene is not established as causal.
Phenotype Name Clinical Significance Source Evidence Score
Acute myeloid leukemia Benign ClinVar
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
KERATOSIS PALMOPLANTARIS STRIATA 1 Disgenet
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Palmoplantar keratoderma i, striate, focal, or diffuse Uncertain significance ClinVar
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Associations from Text MiningDisease associations identified through Pubtator
Disease Name Relationship Type References Evidence Score
Breast Neoplasms Associate 24098497
★☆☆☆☆
Found in Text Mining only
Inflammation Associate 32986378
★☆☆☆☆
Found in Text Mining only
Nasopharyngeal Carcinoma Associate 32986378
★☆☆☆☆
Found in Text Mining only
Nasopharyngitis Associate 32986378
★☆☆☆☆
Found in Text Mining only
Pneumococcal Infections Associate 28974446
★☆☆☆☆
Found in Text Mining only
Rheumatic Diseases Associate 35157519
★☆☆☆☆
Found in Text Mining only