Gene Gene information from NCBI Gene database.
Entrez ID 1280
Gene name Collagen type II alpha 1 chain
Gene symbol COL2A1
Synonyms (NCBI Gene)
ACG2ANFHANFH1AOMCOL11A3EDMMDLCPDOSCDPPLSDTSEDCSEDSTNSEMDSTWKSMDALGSTL1VPED
Chromosome 12
Chromosome location 12q13.11
Summary This gene encodes the alpha-1 chain of type II collagen, a fibrillar collagen found in cartilage and the vitreous humor of the eye. Mutations in this gene are associated with achondrogenesis, chondrodysplasia, early onset familial osteoarthritis, SED cong
SNPs SNP information provided by dbSNP.
178
SNP ID Visualize variation Clinical significance Consequence
rs121912864 C>T Pathogenic Coding sequence variant, missense variant
rs121912868 C>T Pathogenic Coding sequence variant, missense variant
rs121912869 G>A Pathogenic Coding sequence variant, stop gained
rs121912870 C>T Pathogenic-likely-pathogenic, pathogenic Coding sequence variant, missense variant
rs121912871 C>T Pathogenic Coding sequence variant, missense variant
miRNA miRNA information provided by mirtarbase database.
2
miRTarBase ID miRNA Experiments Reference
MIRT048314 hsa-miR-106a-5p CLASH 23622248
MIRT053741 hsa-miR-29b-3p Microarray 22942087
Transcription factors Transcription factors information provided by TRRUST V2 database.
9
Transcription factor Regulation Reference
NFKB1 Repression 19022820
RELA Repression 19022820
SOX9 Activation 18636947;20529846
SOX9 Repression 12713737
SOX9 Unknown 15623506;18759300;20039424
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
65
GO ID Ontology Definition Evidence Reference
GO:0001501 Process Skeletal system development IMP 1429602
GO:0001502 Process Cartilage condensation IEA
GO:0001503 Process Ossification IEA
GO:0001894 Process Tissue homeostasis IEA
GO:0001958 Process Endochondral ossification IEA
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
120140 2200 ENSG00000139219
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
P02458
Protein name Collagen alpha-1(II) chain (Alpha-1 type II collagen) [Cleaved into: Collagen alpha-1(II) chain; Chondrocalcin]
Protein function Type II collagen is specific for cartilaginous tissues. It is essential for the normal embryonic development of the skeleton, for linear growth and for the ability of cartilage to resist compressive forces.
PDB 1U5M , 2FSE , 2SEB , 5NIR , 5OCX , 5OCY , 6BIN , 6HG7 , 6NIX , 7NZE
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00093 VWC 34 89 von Willebrand factor type C domain Family
PF01391 Collagen 116 182 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 199 260 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 237 317 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 429 498 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 792 860 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1158 1217 Collagen triple helix repeat (20 copies) Repeat
PF01410 COLFI 1251 1486 Fibrillar collagen C-terminal domain Family
Tissue specificity TISSUE SPECIFICITY: Isoform 2 is highly expressed in juvenile chondrocyte and low in fetal chondrocyte. {ECO:0000269|PubMed:2355003}.
Sequence
MIRLGAPQTLVLLTLLVAAVLRCQGQDVQEAGSCVQDGQRYNDKDVWKPEPCRICVCDTG
TVLCDDIICEDVKDCLSPEIPFGECCPIC
PTDLATASGQPGPKGQKGEPGDIKDIVGPKG
PPGPQGPAGEQGPRGDRGDKGEKGAPGPRGRDGEPGTPGNPGPPGPPGPPGPPGLGGNFA
AQ
MAGGFDEKAGGAQLGVMQGPMGPMGPRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPMG
PRGPPGPPGKPGDDGEAGKP
GKAGERGPPGPQGARGFPGTPGLPGVKGHRGYPGLDGAKG
EAGAPGVKGESGSPGEN
GSPGPMGPRGLPGERGRTGPAGAAGARGNDGQPGPAGPPGPVG
PAGGPGFPGAPGAKGEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKG
SAGAPGIAGAPGFPGPRGPPGPQGATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQG
APGPAGEEGKRGARGEPG
GVGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLAG
PKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQPG
VMGFPGPKGANGEPGKAGEKGLPGAPGLRGLPGKDGETGAAGPPGPAGPAGERGEQGAPG
PSGFQGLPGPPGPPGEGGKPGDQGVPGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPRG
LPGTPGTDGPKGASGPAGPPGAQGPPGLQGMPGERGAAGIAGPKGDRGDVGEKGPEGAPG
KDGGRGLTGPIGPPGPAGANGEKGEVGPPGPAGSAGARGAPGERGETGPPGPAGFAGPPG
ADGQPGAKGEQGEAGQKGDA
GAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPG
AAGRVGPPGSNGNPGPPGPPGPSGKDGPKGARGDSGPPGRAGEPGLQGPAGPPGEKGEPG
DDGPSGAEGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGASGDRGPPG
PVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGAVGAPGAPGPPGSPGPAG
PTGKQGDRGEAGAQGPMGPSGPAGARGIQGPQGPRGDKGEAGEPGERGLKGHRGFTGLQG
LPGPPGPSGDQGASGPAGPSGPRGPPGPVGPSGKDGANGIPGPIGPPGPRGRSGETGPAG
PPGNPGPPGPPGPPGPG
IDMSAFAGLGPREKGPDPLQYMRADQAAGGLRQHDAEVDATLK
SLNNQIESIRSPEGSRKNPARTCRDLKLCHPEWKSGDYWIDPNQGCTLDAMKVFCNMETG
ETCVYPNPANVPKKNWWSSKSKEKKHIWFGETINGGFHFSYGDDNLAPNTANVQMTFLRL
LSTEGSQNITYHCKNSIAYLDEAAGNLKKALLIQGSNDVEIRAEGNSRFTYTALKDGCTK
HTGKWGKTVIEYRSQKTSRLPIIDIAPMDIGGPEQEFGVDIGPVCF
L
Sequence length 1487
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG  Reactome
  PI3K-Akt signaling pathway
Focal adhesion
ECM-receptor interaction
Protein digestion and absorption
Human papillomavirus infection
  Collagen degradation
Extracellular matrix organization
Collagen biosynthesis and modifying enzymes
Signaling by PDGF
Immunoregulatory interactions between a Lymphoid and a non-Lymphoid cell
Assembly of collagen fibrils and other multimeric structures
Integrin cell surface interactions
Non-integrin membrane-ECM interactions
ECM proteoglycans
NCAM1 interactions
MET activates PTK2 signaling
Collagen chain trimerization
Associated diseases Disease associations from ClinVar categorized as Causal (Pathogenic/Likely Pathogenic) or Unknown.
1076
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Abnormality of the skeletal system Likely pathogenic; Pathogenic rs1555168505 RCV001814182
Absent vertebral body mineralization Pathogenic rs765795867 RCV000415214
Acetabular dysplasia Pathogenic rs121912876 RCV003228897
Achondrogenesis type II Likely pathogenic; Pathogenic rs2136526614, rs2136522964, rs2136577158, rs2136551606, rs2136526244, rs1447463543, rs2136526515, rs2136576211, rs2540091565, rs2540132223, rs2540151788, rs2540129721, rs886044555, rs2540133960, rs1555165336
View all (19 more)
RCV003388006
RCV003323909
RCV001808940
RCV001822977
RCV002227895
RCV002260533
RCV002273087
RCV002279899
RCV003151699
RCV003157996
RCV003236501
RCV003328140
RCV003987496
RCV003447778
RCV003485995
RCV003988408
RCV003323361
RCV000018915
RCV000022480
RCV001197973
RCV000022481
RCV000022483
RCV004555254
RCV001196300
RCV001196261
RCV003330729
RCV000505580
RCV001198649
RCV000578377
RCV001822860
RCV000781310
RCV000781309
RCV002265925
RCV001199202
Unknown Diseases with uncertain, conflicting, or no pathogenic evidence in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Abnormal cartilage collagen Uncertain significance rs2136512203 RCV001541896
Cervical cancer Benign rs76104937 RCV005900025
Congenital aneurysm of ascending aorta Uncertain significance rs758162798 RCV004555526
EBV-positive nodal T- and NK-cell lymphoma Likely benign rs2540159066 RCV004560200
Associations from Text MiningDisease associations identified through Pubtator
Disease Name Relationship Type References
Achondrogenesis type 2 Associate 3195588, 34573377, 36939200, 7741714, 7829510
Achondroplasia Associate 3005580, 9258750
Albinism Associate 39596324
Alzheimer Disease Associate 32854783
Arrhythmogenic Right Ventricular Dysplasia Associate 35791160
Arthritis Associate 20131279
Arthritis Rheumatoid Inhibit 35955467
Arthropathy progressive pseudorheumatoid of childhood Associate 26183434
Ascending aortic aneurysm hypertelorism bifid uvula cleft palate and arterial tortuosity Associate 30170566
Astrocytoma Associate 25521223