Gene Gene information from NCBI Gene database.
Entrez ID 81578
Gene name Collagen type XXI alpha 1 chain
Gene symbol COL21A1
Synonyms (NCBI Gene)
COLA1LFP633
Chromosome 6
Chromosome location 6p12.1|6p12.3-p11.2
Summary This gene encodes the alpha chain of type XXI collagen, a member of the FACIT (fibril-associated collagens with interrupted helices) collagen family. Type XXI collagen is localized to tissues containing type I collagen and maintains the integrity of the e
miRNA miRNA information provided by mirtarbase database.
68
miRTarBase ID miRNA Experiments Reference
MIRT006383 hsa-miR-29c-3p Luciferase reporter assay 21436257
MIRT006383 hsa-miR-29c-3p Luciferase reporter assay 21436257
MIRT006383 hsa-miR-29c-3p Luciferase reporter assay 21436257
MIRT018264 hsa-miR-335-5p Microarray 18185580
MIRT902730 hsa-miR-1305 CLIP-seq
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
11
GO ID Ontology Definition Evidence Reference
GO:0005576 Component Extracellular region IEA
GO:0005576 Component Extracellular region TAS
GO:0005581 Component Collagen trimer IEA
GO:0005737 Component Cytoplasm IEA
GO:0005788 Component Endoplasmic reticulum lumen IEA
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
610002 17025 ENSG00000124749
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
Q96P44
Protein name Collagen alpha-1(XXI) chain
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00092 VWA 37 206 von Willebrand factor type A domain Domain
PF01391 Collagen 446 501 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 482 558 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 527 597 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 671 743 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 730 790 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 824 884 Collagen triple helix repeat (20 copies) Repeat
Tissue specificity TISSUE SPECIFICITY: Highly expressed in lymph node, jejunum, pancreas, stomach, trachea, testis, uterus and placenta; moderately expressed in brain, colon, lung, prostate, spinal cord, salivary gland and vascular smooth-muscle cells and very weakly expres
Sequence
MAHYITFLCMVLVLLLQNSVLAEDGEVRSSCRTAPTDLVFILDGSYSVGPENFEIVKKWL
VNITKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAI
QFALDYLFAKSSRFLTKIAVVLTDGKSQDDVKDAAQAARDSKITLFAIGVGSETEDAELR
AIANKPSSTYVFYVEDYIAISKIREV
MKQKLCEESVCPTRIPVAARDERGFDILLGLDVN
KKVKKRIQLSPKKIKGYEVTSKVDLSELTSNVFPEGLPPSYVFVSTQRFKVKKIWDLWRI
LTIDGRPQIAVTLNGVDKILLFTTTSVINGSQVVTFANPQVKTLFDEGWHQIRLLVTEQD
VTLYIDDQQIENKPLHPVLGILINGQTQIGKYSGKEETVQFDVQKLRIYCDPEQNNRETA
CEIPGFNGECLNGPSDVGSTPAPCICPPGKPGLQGPKGDPGLPGNPGYPGQPGQDGKPGY
Q
GIAGTPGVPGSPGIQGARGLPGYKGEPGRDGDKGDRGLPGFPGLHGMPGSKGEMGAKGD
KGSPGFYGKKGAKGEKGN
AGFPGLPGPAGEPGRHGKDGLMGSPGFKGEAGSPGAPGQ
DGT
RGEPGIPGFPGNRGLMGQKGEIGPPGQQGKKGAPGMPGLMGSNGSPGQPGTPGSKGSKGE
PGIQGMPGASGLKGEPGATGSPGEPGYMGLPGIQGKKGDKGNQGEKGIQGQKGENGRQGI
PGQQGIQGH
HGAKGERGEKGEPGVRGAIGSKGESGVDGLMGPAGPKGQPGDPGPQGPPGL
DGKPGREFSE
QFIRQVCTDVIRAQLPVLLQSGRIRNCDHCLSQHGSPGIPGPPGPIGPEG
PRGLPGLPGRDGVPGLVGVPGRPGVRGLKGLPGRNGEKGSQGFG
YPGEQGPPGPPGPEGP
PGISKEGPPGDPGLPGKDGDHGKPGIQGQPGPPGICDPSLCFSVIARRDPFRKGPNY
Sequence length 957
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG  Reactome
  Protein digestion and absorption   Collagen biosynthesis and modifying enzymes
Collagen chain trimerization
Associated diseases Disease associations from ClinVar (causal & non-causal) and other databases (OMIM, Orphanet, GWAS, etc.).
5
Evidence Score: ★☆☆☆☆  Gene-disease association found in Text Mining only ★★☆☆☆  Found in Text Mining and Unknown/Other Associations ★★★☆☆  Reported in Unknown/Other Associations across ≥2 Sources ★★★★☆  ClinVar: Pathogenic/Likely Pathogenic (<5 Variants) ★★★★★  ClinVar: Pathogenic/Likely Pathogenic (≥5 Variants)
Unknown / Other Associations ClinVar entries with uncertain/conflicting evidence, and associations from other databases (OMIM, Orphanet, GWAS, etc.) where the gene is not established as causal.
Phenotype Name Clinical Significance Source Evidence Score
CHRONIC OBSTRUCTIVE PULMONARY DISEASE GWAS catalog
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
COLOR VISION DISORDER GWAS catalog
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
PSYCHOSIS GWAS catalog
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
SCHIZOPHRENIA GWAS catalog
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Associations from Text Mining Disease associations identified through text mining
Disease Name Disease (Merged) Source PMID Relationship Type Evidence Score
Cardiomyopathy Dilated Dilated cardiomyopathy Pubtator 27936202 Associate
★☆☆☆☆
Found in Text Mining only
Fuchs' Endothelial Dystrophy Fuchs endothelial dystrophy Pubtator 28726551 Associate
★☆☆☆☆
Found in Text Mining only
Hypertension Hypertension Pubtator 37451613 Associate
★☆☆☆☆
Found in Text Mining only
Hypertensive disease Hypertension BEFREE 27618447
★☆☆☆☆
Found in Text Mining only
Orofacial Cleft 1 Orofacial cleft Pubtator 26868259, 30924295 Associate
★☆☆☆☆
Found in Text Mining only
Other and unspecified reactive psychosis Reactive Psychosis GWASCAT_DG 24132900
★☆☆☆☆
Found in Text Mining only
Psychotic Disorders Psychosis GWASCAT_DG 24132900
★☆☆☆☆
Found in Text Mining only