Gene Gene information from NCBI Gene database.
Entrez ID 1278
Gene name Collagen type I alpha 2 chain
Gene symbol COL1A2
Synonyms (NCBI Gene)
EDSARTH2EDSCVOI4
Chromosome 7
Chromosome location 7q21.3
Summary This gene encodes the pro-alpha2 chain of type I collagen whose triple helix comprises two alpha1 chains and one alpha2 chain. Type I is a fibril-forming collagen found in most connective tissues and is abundant in bone, cornea, dermis and tendon. Mutatio
SNPs SNP information provided by dbSNP.
207
SNP ID Visualize variation Clinical significance Consequence
rs66612022 G>A,T Pathogenic Missense variant, coding sequence variant
rs66619856 G>A,T Pathogenic Missense variant, coding sequence variant
rs66773001 G>A,T Likely-pathogenic Missense variant, coding sequence variant
rs66820119 G>A,C,T Pathogenic Splice acceptor variant
rs66883877 G>A,C,T Likely-pathogenic Missense variant, coding sequence variant
miRNA miRNA information provided by mirtarbase database.
519
miRTarBase ID miRNA Experiments Reference
MIRT001928 hsa-miR-29c-3p Luciferase reporter assay 18390668
MIRT000472 hsa-let-7g-5p qRT-PCRLuciferase reporter assayWestern blot 20338660
MIRT001928 hsa-miR-29c-3p Luciferase reporter assay 18390668
MIRT001928 hsa-miR-29c-3p ImmunohistochemistryqRT-PCR 21125666
MIRT001928 hsa-miR-29c-3p ImmunohistochemistryqRT-PCR 21125666
Transcription factors Transcription factors information provided by TRRUST V2 database.
17
Transcription factor Regulation Reference
CEBPZ Unknown 8910550
CIITA Repression 16439692
CIITA Unknown 15247294
EP300 Unknown 24058639
FLI1 Repression 24058639
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
47
GO ID Ontology Definition Evidence Reference
GO:0001501 Process Skeletal system development IMP 8841196, 17955022, 18375391
GO:0001568 Process Blood vessel development IMP 17211858
GO:0002020 Function Protease binding IPI 19932771
GO:0005201 Function Extracellular matrix structural constituent IEA
GO:0005201 Function Extracellular matrix structural constituent NAS 8982144
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
120160 2198 ENSG00000164692
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
P08123
Protein name Collagen alpha-2(I) chain (Alpha-2 type I collagen)
Protein function Type I collagen is a member of group I collagen (fibrillar forming collagen).
PDB 5CTD , 5CTI , 5CVA , 6JEC
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF01391 Collagen 29 82 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 88 150 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 130 207 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 460 529 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 601 665 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1045 1114 Collagen triple helix repeat (20 copies) Repeat
PF01410 COLFI 1131 1365 Fibrillar collagen C-terminal domain Family
Tissue specificity TISSUE SPECIFICITY: Forms the fibrils of tendon, ligaments and bones. In bones the fibrils are mineralized with calcium hydroxyapatite.
Sequence
MLSFVDTRTLLLLAVTLCLATCQSLQEETVRKGPAGDRGPRGERGPPGPPGRDGEDGPTG
PPGPPGPPGPPGLGGNFAAQYD
GKGVGLGPGPMGLMGPRGPPGAAGAPGPQGFQGPAGEP
GEPGQTGPA
GARGPAGPPGKAGEDGHPGKPGRPGERGVVGPQGARGFPGTPGLPGFKGIR
GHNGLDGLKGQPGAPGVKGEPGAPGEN
GTPGQTGARGLPGERGRVGAPGPAGARGSDGSV
GPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGPAGPAGPRGEVGLPGLSGPVGPPGNP
GANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNK
GEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGPPGPPGLRGSPGSRGLPGADGRAGVMGPP
GSRGASGPAGVRGPNGDAGRPGEPGLMGPRGLPGSPGNIGPAGKEGPVGLPGIDGRPGPI
GPAGARGEPGNIGFPGPKGPTGDPGKNGDKGHAGLAGARGAPGPDGNNG
AQGPPGPQGVQ
GGKGEQGPPGPPGFQGLPGPSGPAGEVGKPGERGLHGEFGLPGPAGPRGERGPPGESGAA
GPTGPIGSRGPSGPPGPDGNKGEPGVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEP
GLRGE
IGNPGRDGARGAPGAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPA
GPNGFAGPAGAAGQPGAKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPGPAGSRGDG
GPPGMTGFPGAAGRTGPPGPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRTGEVGAVGPP
GFAGEKGPSGEAGTAGPPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIA
GPPGARGPPGAVGSPGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGAA
GAPGPHGPVGPAGKHGNRGETGPSGPVGPAGAVGPRGPSGPQGIRGDKGEPGEKGPRGLP
GLKGHNGLQGLPGIAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIR
GPQGHQGPAGPPGPPGPPGPPGVSGGGYDFGYDG
DFYRADQPRSAPSLRPKDYEVDATLK
SLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTG
ETCIRAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMATQLAFMRLL
ANYASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK
TNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPVCF
K
Sequence length 1366
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG  Reactome
  PI3K-Akt signaling pathway
Focal adhesion
ECM-receptor interaction
Platelet activation
Cytoskeleton in muscle cells
Relaxin signaling pathway
AGE-RAGE signaling pathway in diabetic complications
Protein digestion and absorption
Amoebiasis
Human papillomavirus infection
Proteoglycans in cancer
Diabetic cardiomyopathy
  GPVI-mediated activation cascade
Collagen degradation
Extracellular matrix organization
Collagen biosynthesis and modifying enzymes
Immunoregulatory interactions between a Lymphoid and a non-Lymphoid cell
Assembly of collagen fibrils and other multimeric structures
Cell surface interactions at the vascular wall
Integrin cell surface interactions
Anchoring fibril formation
Crosslinking of collagen fibrils
Non-integrin membrane-ECM interactions
ECM proteoglycans
GP1b-IX-V activation signalling
Interleukin-4 and Interleukin-13 signaling
Platelet Adhesion to exposed collagen
Platelet Aggregation (Plug Formation)
MET activates PTK2 signaling
Collagen chain trimerization
Associated diseases Disease associations from ClinVar categorized as Causal (Pathogenic/Likely Pathogenic) or Unknown.
4538
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
- Pathogenic rs72656362 RCV000018796
Abnormality of the skeletal system Likely pathogenic; Pathogenic rs1554395833, rs1054264002 RCV001836846
RCV001814270
Bruck syndrome 1 Likely pathogenic rs794727669 RCV002469046
Cardiovascular phenotype Likely pathogenic; Pathogenic rs749621872, rs193922168, rs2484702812, rs1114167416, rs72656387, rs72658129, rs1410254723, rs1054264002, rs68063264 RCV002438886
RCV002325721
RCV005323577
RCV002446951
RCV005540011
RCV006276142
RCV004992415
RCV002375009
RCV002451621
Unknown Diseases with uncertain, conflicting, or no pathogenic evidence in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Adrenocortical carcinoma, hereditary Uncertain significance rs772672797 RCV005909123
Cervical cancer Benign; Conflicting classifications of pathogenicity rs7804898, rs72658163 RCV005915152
RCV005887560
Cholangiocarcinoma Benign rs7804898, rs62464629, rs7805430 RCV005915154
RCV005921507
RCV005904143
Connective tissue disorder Benign; Likely benign; Conflicting classifications of pathogenicity; Uncertain significance rs368468, rs189374343, rs765118884, rs886062514, rs141088934, rs139851311, rs72658163, rs189557655, rs763509640, rs145355907, rs754825427, rs780687409, rs150670521, rs1554394649, rs372678526
View all (9 more)
RCV000659372
RCV000659378
RCV000659365
RCV000680484
RCV000680489
RCV000659381
RCV000680486
RCV000659368
RCV000659367
RCV000680488
RCV000659376
RCV000680483
RCV000680487
RCV000659366
RCV000659369
RCV000659371
RCV000659373
RCV000659374
RCV000659375
RCV000659377
RCV000659379
RCV000659380
RCV000659382
RCV000680485
Associations from Text MiningDisease associations identified through Pubtator
Disease Name Relationship Type References
Adenocarcinoma Associate 26482433, 36171259
Adenocarcinoma of Lung Associate 31074380, 37795779
Adenoma Associate 30175151
Alopecia Associate 16755026
Alzheimer Disease Associate 26482433
Amelogenesis Imperfecta Type IB Associate 8702873
Aneurysm Associate 27381111
Aneurysm Ruptured Associate 34290266
Anodontia Associate 23227268, 32234057
Aortic Valve Disease Associate 32089075