GediPNet logo

COL1A2 (collagen type I alpha 2 chain)

Gene
Entrez ID Entrez Gene ID - the GENE ID in NCBI Gene database.
1278
Gene nameGene Name - the full gene name approved by the HGNC.
Collagen type I alpha 2 chain
Gene symbolGene Symbol - the official gene symbol approved by the HGNC, which is a short abbreviated form of the gene name.
COL1A2
SynonymsGene synonyms aliases
EDSARTH2, EDSCV, OI4
ChromosomeChromosome number
7
Chromosome locationChromosomal Location - indicates the cytogenetic location of the gene or region on the chromosome.
7q21.3
SummarySummary of gene provided in NCBI Entrez Gene.
This gene encodes the pro-alpha2 chain of type I collagen whose triple helix comprises two alpha1 chains and one alpha2 chain. Type I is a fibril-forming collagen found in most connective tissues and is abundant in bone, cornea, dermis and tendon. Mutations in this gene are associated with osteogenesis imperfecta types I-IV, Ehlers-Danlos syndrome type VIIB, recessive Ehlers-Danlos syndrome Classical type, idiopathic osteoporosis, and atypical Marfan syndrome. Symptoms associated with mutations in this gene, however, tend to be less severe than mutations in the gene for the alpha1 chain of type I collagen (COL1A1) reflecting the different role of alpha2 chains in matrix integrity. Three transcripts, resulting from the use of alternate polyadenylation signals, have been identified for this gene. [provided by R. Dalgleish, Feb 2008]
SNPsSNP information provided by dbSNP.
SNP ID Visualize variation Clinical significance Consequence
rs66612022 G>A,T Pathogenic Missense variant, coding sequence variant
rs66619856 G>A,T Pathogenic Missense variant, coding sequence variant
rs66773001 G>A,T Likely-pathogenic Missense variant, coding sequence variant
rs66820119 G>A,C,T Pathogenic Splice acceptor variant
rs66883877 G>A,C,T Likely-pathogenic Missense variant, coding sequence variant
miRNAmiRNA information provided by mirtarbase database.
miRTarBase ID miRNA Experiments Reference
MIRT000472 hsa-let-7g-5p qRT-PCR, Luciferase reporter assay, Western blot 20338660
MIRT001928 hsa-miR-29c-3p Luciferase reporter assay, Reporter assay;Other 18390668
MIRT001928 hsa-miR-29c-3p Immunohistochemistry, qRT-PCR 21125666
MIRT001928 hsa-miR-29c-3p Microarray, qRT-PCR 24590289
MIRT569997 hsa-miR-6803-5p PAR-CLIP 20371350
Transcription factors
Transcription factor Regulation Reference
CEBPZ Unknown 8910550
CIITA Repression 16439692
CIITA Unknown 15247294
EP300 Unknown 24058639
FLI1 Repression 24058639
Gene ontology (GO)Gene ontology information of associated ontologies with gene provided by GO database.
GO ID Ontology Definition Evidence Reference
GO:0001501 Process Skeletal system development IMP 8841196, 17955022, 18375391
GO:0001568 Process Blood vessel development IMP 17211858
GO:0002020 Function Protease binding IPI 19932771
GO:0005201 Function Extracellular matrix structural constituent IBA 21873635
GO:0005201 Function Extracellular matrix structural constituent NAS 8982144
Other IDsOther ids provides unique ids of gene in databases such as OMIM, HGNC, ENSEMBLE.
MIM
HGNC
e!Ensembl
Protein
UniProt ID P08123
Protein name Collagen alpha-2(I) chain (Alpha-2 type I collagen)
Protein function Type I collagen is a member of group I collagen (fibrillar forming collagen).
PDB 5CTD , 5CTI , 5CVA , 6JEC
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF01391 Collagen
29 82
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
88 150
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
130 207
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
460 529
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
601 665
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1045 1114
Collagen triple helix repeat (20 copies)
Repeat
PF01410 COLFI
1131 1365
Fibrillar collagen C-terminal domain
Family
Sequence
MLSFVDTRTLLLLAVTLCLATCQSLQEETVRKGPAGDRGPRGERGPPGPPGRDGEDGPTG
PPGPPGPPGPPGLGGNFAAQYD
GKGVGLGPGPMGLMGPRGPPGAAGAPGPQGFQGPAGEP
GEPGQTGPA
GARGPAGPPGKAGEDGHPGKPGRPGERGVVGPQGARGFPGTPGLPGFKGIR
GHNGLDGLKGQPGAPGVKGEPGAPGEN
GTPGQTGARGLPGERGRVGAPGPAGARGSDGSV
GPVGPAGPIGSAGPPGFPGAPGPKGEIGAVGNAGPAGPAGPRGEVGLPGLSGPVGPPGNP
GANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNK
GEPGSAGPQGPPGPSGEEGKRGPNGEAGSAGPPGPPGLRGSPGSRGLPGADGRAGVMGPP
GSRGASGPAGVRGPNGDAGRPGEPGLMGPRGLPGSPGNIGPAGKEGPVGLPGIDGRPGPI
GPAGARGEPGNIGFPGPKGPTGDPGKNGDKGHAGLAGARGAPGPDGNNG
AQGPPGPQGVQ
GGKGEQGPPGPPGFQGLPGPSGPAGEVGKPGERGLHGEFGLPGPAGPRGERGPPGESGAA
GPTGPIGSRGPSGPPGPDGNKGEPGVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEP
GLRGE
IGNPGRDGARGAPGAVGAPGPAGATGDRGEAGAAGPAGPAGPRGSPGERGEVGPA
GPNGFAGPAGAAGQPGAKGERGAKGPKGENGVVGPTGPVGAAGPAGPNGPPGPAGSRGDG
GPPGMTGFPGAAGRTGPPGPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRTGEVGAVGPP
GFAGEKGPSGEAGTAGPPGTPGPQGLLGAPGILGLPGSRGERGLPGVAGAVGEPGPLGIA
GPPGARGPPGAVGSPGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNIGPVGAA
GAPGPHGPVGPAGKHGNRGETGPSGPVGPAGAVGPRGPSGPQGIRGDKGEPGEKGPRGLP
GLKGHNGLQGLPGIAGHHGDQGAPGSVGPAGPRGPAGPSGPAGKDGRTGHPGTVGPAGIR
GPQGHQGPAGPPGPPGPPGPPGVSGGGYDFGYDG
DFYRADQPRSAPSLRPKDYEVDATLK
SLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMDAIKVYCDFSTG
ETCIRAQPENIPAKNWYRSSKDKKHVWLGETINAGSQFEYNVEGVTSKEMATQLAFMRLL
ANYASQNITYHCKNSIAYMDEETGNLKKAVILQGSNDVELVAEGNSRFTYTVLVDGCSKK
TNEWGKTIIEYKTNKPSRLPFLDIAPLDIGGADQEFFVDIGPVCF
K
Sequence length 1366
Interactions View interactions

| © 2021, Biomedical Informatics Centre, NIRRH |
ICMR-National Institute for Research in Reproductive Health, Jehangir Merwanji Street, Parel, Mumbai-400012
Tel: +91-22-24192104, Fax No: +91-22-24139412