Gene Gene information from NCBI Gene database.
Entrez ID 1278
Gene name Collagen type I alpha 2 chain
Gene symbol COL1A2
Synonyms (NCBI Gene)
EDSARTH2EDSCVOI4
Chromosome 7
Chromosome location 7q21.3
Summary This gene encodes the pro-alpha2 chain of type I collagen whose triple helix comprises two alpha1 chains and one alpha2 chain. Type I is a fibril-forming collagen found in most connective tissues and is abundant in bone, cornea, dermis and tendon. Mutatio
SNPs SNP information provided by dbSNP.
207
SNP ID Visualize variation Clinical significance Consequence
rs66612022 G>A,T Pathogenic Missense variant, coding sequence variant
rs66619856 G>A,T Pathogenic Missense variant, coding sequence variant
rs66773001 G>A,T Likely-pathogenic Missense variant, coding sequence variant
rs66820119 G>A,C,T Pathogenic Splice acceptor variant
rs66883877 G>A,C,T Likely-pathogenic Missense variant, coding sequence variant
miRNA miRNA information provided by mirtarbase database.
519
miRTarBase ID miRNA Experiments Reference
MIRT001928 hsa-miR-29c-3p Luciferase reporter assay 18390668
MIRT000472 hsa-let-7g-5p qRT-PCRLuciferase reporter assayWestern blot 20338660
MIRT001928 hsa-miR-29c-3p Luciferase reporter assay 18390668
MIRT001928 hsa-miR-29c-3p ImmunohistochemistryqRT-PCR 21125666
MIRT001928 hsa-miR-29c-3p ImmunohistochemistryqRT-PCR 21125666
Transcription factors Transcription factors information provided by TRRUST V2 database.
17
Transcription factor Regulation Reference
CEBPZ Unknown 8910550
CIITA Repression 16439692
CIITA Unknown 15247294
EP300 Unknown 24058639
FLI1 Repression 24058639
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
47
GO ID Ontology Definition Evidence Reference
GO:0001501 Process Skeletal system development IMP 8841196, 17955022, 18375391
GO:0001568 Process Blood vessel development IMP 17211858
GO:0002020 Function Protease binding IPI 19932771
GO:0005201 Function Extracellular matrix structural constituent IEA
GO:0005201 Function Extracellular matrix structural constituent NAS 8982144
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
120160 2198 ENSG00000164692
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
P08123
Protein name Collagen alpha-2(I) chain (Alpha-2 type I collagen)
Protein function Type I collagen is a member of group I collagen (fibrillar forming collagen).
PDB 5CTD , 5CTI , 5CVA , 6JEC
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF01391 Collagen 29 82 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 88 150 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 130 207 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 460 529 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 601 665 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1045 1114 Collagen triple helix repeat (20 copies) Repeat
PF01410 COLFI 1131 1365 Fibrillar collagen C-terminal domain Family
Tissue specificity TISSUE SPECIFICITY: Forms the fibrils of tendon, ligaments and bones. In bones the fibrils are mineralized with calcium hydroxyapatite.
Sequence
MLSFVDTRTLLLLAVTLCLATCQSLQEETVRKGPAGDRGPRGERGPPGPPGRDGEDGPTG
PPGPPGPPGPPGLGGNFAAQYD
GKGVGLGPGPMGLMGPRGPPGAAGAPGPQGFQGPAGEP
GEPGQTGPA
GARGPAGPPGKAGEDGHPGKPGRPGERGVVGPQGARGFPGTPGLPGFKGIR
GHNGLDGLKGQPGAPGVKGEPGAPGEN
GTPGQTGARGLPGERGRVGAPGPAGARGSDGSV
GPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGPAGPAGPRGEVGLPGLSGPVGPPGNP
GANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNK
GEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGPPGPPGLRGSPGSRGLPGADGRAGVMGPP
GSRGASGPAGVRGPNGDAGRPGEPGLMGPRGLPGSPGNIGPAGKEGPVGLPGIDGRPGPI
GPAGARGEPGNIGFPGPKGPTGDPGKNGDKGHAGLAGARGAPGPDGNNG
AQGPPGPQGVQ
GGKGEQGPPGPPGFQGLPGPSGPAGEVGKPGERGLHGEFGLPGPAGPRGERGPPGESGAA
GPTGPIGSRGPSGPPGPDGNKGEPGVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEP
GLRGE
IGNPGRDGARGAPGAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPA
GPNGFAGPAGAAGQPGAKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPGPAGSRGDG
GPPGMTGFPGAAGRTGPPGPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRTGEVGAVGPP
GFAGEKGPSGEAGTAGPPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIA
GPPGARGPPGAVGSPGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGAA
GAPGPHGPVGPAGKHGNRGETGPSGPVGPAGAVGPRGPSGPQGIRGDKGEPGEKGPRGLP
GLKGHNGLQGLPGIAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIR
GPQGHQGPAGPPGPPGPPGPPGVSGGGYDFGYDG
DFYRADQPRSAPSLRPKDYEVDATLK
SLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTG
ETCIRAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMATQLAFMRLL
ANYASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK
TNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPVCF
K
Sequence length 1366
Interactions View interactions
Associated diseases Disease associations from ClinVar (causal & non-causal) and other databases (OMIM, Orphanet, GWAS, etc.).
95
Evidence Score: ★☆☆☆☆  Gene-disease association found in Text Mining only ★★☆☆☆  Found in Text Mining and Unknown/Other Associations ★★★☆☆  Reported in Unknown/Other Associations across ≥2 Sources ★★★★☆  ClinVar: Pathogenic/Likely Pathogenic (<5 Variants) ★★★★★  ClinVar: Pathogenic/Likely Pathogenic (≥5 Variants)
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession Evidence Score
Abnormality of the skeletal system Likely pathogenic; Pathogenic rs1554395833, rs1054264002 RCV001836846
RCV001814270
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
Bruck syndrome 1 Likely pathogenic rs794727669 RCV002469046
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
Cardiovascular phenotype Likely pathogenic; Pathogenic rs749621872, rs193922168, rs2484702812, rs1114167416, rs72656387, rs72658129, rs1410254723, rs1054264002, rs68063264 RCV002438886
RCV002325721
RCV005323577
RCV002446951
RCV005540011
View all (4 more)
★★★★★
ClinVar: Pathogenic / Likely Pathogenic (≥5 Variants)
COL1A2-related disorder Likely pathogenic; Pathogenic rs749621872, rs72659304, rs2115954840, rs72656386, rs72656378, rs2115921279, rs1381927942, rs2115959456, rs2484725715, rs2115941300, rs2484729533, rs2484737562, rs2484724068, rs72658158, rs2484708250
View all (13 more)
RCV004743444
RCV005867215
RCV003416459
RCV003401970
RCV003418259
View all (23 more)
★★★★★
ClinVar: Pathogenic / Likely Pathogenic (≥5 Variants)
Unknown / Other Associations ClinVar entries with uncertain/conflicting evidence, and associations from other databases (OMIM, Orphanet, GWAS, etc.) where the gene is not established as causal.
Phenotype Name Clinical Significance Source Evidence Score
Adrenocortical carcinoma, hereditary Uncertain significance ClinVar
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
ASTROCYTOMA GWAS catalog
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
BONE FRAGILITY WITH CONTRACTURES, ARTERIAL RUPTURE, AND DEAFNESS CTD
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
BREAST CANCER GWAS catalog
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Associations from Text Mining Disease associations identified through text mining
Disease Name Disease (Merged) Source PMID Relationship Type Evidence Score
Achondrogenesis type 2 Achondrogenesis BEFREE 8175802
★☆☆☆☆
Found in Text Mining only
Adenocarcinoma Adenocarcinoma BEFREE 28814946
★☆☆☆☆
Found in Text Mining only
Adenocarcinoma of lung (disorder) Lung adenocarcinoma BEFREE 19949890, 30892233
★☆☆☆☆
Found in Text Mining only
Adenoma Adenoma BEFREE 23045325
★☆☆☆☆
Found in Text Mining only
Adult Medulloblastoma Medulloblastoma BEFREE 18664619
★☆☆☆☆
Found in Text Mining only
Alzheimer Disease Alzheimer disease Pubtator 26482433 Associate
★☆☆☆☆
Found in Text Mining only
Amelogenesis Imperfecta Amelogenesis imperfecta BEFREE 17552940
★☆☆☆☆
Found in Text Mining only
Amelogenesis Imperfecta Type IB Amelogenesis imperfecta Pubtator 8702873 Associate
★☆☆☆☆
Found in Text Mining only
Aneurysm Aneurysm Pubtator 27381111 Associate
★☆☆☆☆
Found in Text Mining only
Anodontia Anodontia Pubtator 23227268, 32234057 Associate
★☆☆☆☆
Found in Text Mining only