GediPNet logo

COL1A1 (collagen type I alpha 1 chain)

Gene
Entrez ID Entrez Gene ID - the GENE ID in NCBI Gene database.
1277
Gene nameGene Name - the full gene name approved by the HGNC.
Collagen type I alpha 1 chain
Gene symbolGene Symbol - the official gene symbol approved by the HGNC, which is a short abbreviated form of the gene name.
COL1A1
SynonymsGene synonyms aliases
CAFYD, EDSARTH1, EDSC, OI1, OI2, OI3, OI4
ChromosomeChromosome number
17
Chromosome locationChromosomal Location - indicates the cytogenetic location of the gene or region on the chromosome.
17q21.33
SummarySummary of gene provided in NCBI Entrez Gene.
This gene encodes the pro-alpha1 chains of type I collagen whose triple helix comprises two alpha1 chains and one alpha2 chain. Type I is a fibril-forming collagen found in most connective tissues and is abundant in bone, cornea, dermis and tendon. Mutations in this gene are associated with osteogenesis imperfecta types I-IV, Ehlers-Danlos syndrome type VIIA, Ehlers-Danlos syndrome Classical type, Caffey Disease and idiopathic osteoporosis. Reciprocal translocations between chromosomes 17 and 22, where this gene and the gene for platelet-derived growth factor beta are located, are associated with a particular type of skin tumor called dermatofibrosarcoma protuberans, resulting from unregulated expression of the growth factor. Two transcripts, resulting from the use of alternate polyadenylation signals, have been identified for this gene. [provided by R. Dalgleish, Feb 2008]
SNPsSNP information provided by dbSNP.
SNP ID Visualize variation Clinical significance Consequence
rs1800211 C>T Likely-pathogenic, uncertain-significance Coding sequence variant, missense variant, intron variant
rs1800214 G>A,C,T Conflicting-interpretations-of-pathogenicity, uncertain-significance Coding sequence variant, missense variant
rs2586486 G>A,T Pathogenic Coding sequence variant, stop gained, missense variant
rs8179178 C>A,T Pathogenic Coding sequence variant, missense variant
rs34940368 G>A,C Likely-benign, pathogenic, benign Synonymous variant, missense variant, coding sequence variant
miRNAmiRNA information provided by mirtarbase database.
miRTarBase ID miRNA Experiments Reference
MIRT000100 hsa-miR-29b-3p Immunoblot, Luciferase reporter assay, qRT-PCR 19913496
MIRT000100 hsa-miR-29b-3p Luciferase reporter assay, qRT-PCR, Western blot 20657750
MIRT000100 hsa-miR-29b-3p Microarray, qRT-PCR, Western blot 22297492
MIRT000100 hsa-miR-29b-3p Luciferase reporter assay 23478036
MIRT000100 hsa-miR-29b-3p Reporter assay 21769867
Transcription factors
Transcription factor Regulation Reference
CIITA Repression 16439692
ETS1 Unknown 16564026
MKL1 Activation 22049076
MYB Activation 9989795
MYBL2 Unknown 14613485
Gene ontology (GO)Gene ontology information of associated ontologies with gene provided by GO database.
GO ID Ontology Definition Evidence Reference
GO:0001501 Process Skeletal system development IBA 21873635
GO:0001501 Process Skeletal system development IMP 1874719, 8097422, 14976317
GO:0001503 Process Ossification IBA 21873635
GO:0001568 Process Blood vessel development IBA 21873635
GO:0001568 Process Blood vessel development IMP 17211858
Other IDsOther ids provides unique ids of gene in databases such as OMIM, HGNC, ENSEMBLE.
MIM
HGNC
e!Ensembl
Protein
UniProt ID P02452
Protein name Collagen alpha-1(I) chain (Alpha-1 type I collagen)
Protein function Type I collagen is a member of group I collagen (fibrillar forming collagen).
PDB 1Q7D , 2LLP , 3EJH , 3GXE , 5CTD , 5CTI , 5CVA , 5CVB , 5K31 , 5OU8 , 5OU9
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00093 VWC
40 95
von Willebrand factor type C domain
Family
PF01391 Collagen
107 163
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
177 238
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
236 295
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
296 355
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
356 415
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
407 476
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
779 838
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
835 898
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1013 1080
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1076 1138
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1133 1195
Collagen triple helix repeat (20 copies)
Repeat
PF01410 COLFI
1227 1463
Fibrillar collagen C-terminal domain
Family
Sequence
MFSFVDLRLLLLLAATALLTHGQEEGQVEGQDEDIPPITCVQNGLRYHDRDVWKPEPCRI
CVCDNGKVLCDDVICDETKNCPGAEVPEGECCPVC
PDGSESPTDQETTGVEGPKGDTGPR
GPRGPAGPPGRDGIPGQPGLPGPPGPPGPPGPPGLGGNFAPQL
SYGYDEKSTGGISVPGP
MGPSGPRGLPGPPGAPGPQGFQGPPGEPGEPGASGPMGPRGPPGPPGKNGDDGEA
GKPGR
PGERGPPGPQGARGLPGTAGLPGMKGHRGFSGLDGAKGDAGPAGPKGEPGSPGEN
GAPGQ
MGPRGLPGERGRPGAPGPAGARGNDGATGAAGPPGPTGPAGPPGFPGAVGAKGEA
GPQGP
RGSEGPQGVRGEPGPPGPAGAAGPAGNPGADGQPGAKGANGAPGIA
GAPGFPGARGPSGP
QGPGGPPGPKGNSGEPGAPGSKGDTGAKGEPGPVGVQGPPGPAGEEGKRGARGEPG
PTGL
PGPPGERGGPGSRGFPGADGVAGPKGPAGERGSPGPAGPKGSPGEAGRPGEAGLPGAKGL
TGSPGSPGPDGKTGPPGPAGQDGRPGPPGPPGARGQAGVMGFPGPKGAAGEPGKAGERGV
PGPPGAVGPAGKDGEAGAQGPPGPAGPAGERGEQGPAGSPGFQGLPGPAGPPGEAGKPGE
QGVPGDLGAPGPSGARGERGFPGERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPGAPGS
QGAPGLQGMPGERGAAGLPGPKGDRGDAGPKGADGSPGKDGVRGLTGPIGPPGPAGAPGD
KGESGPSGPAGPTGARGAPGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGA
KGDAGP
PGPAGPAGPPGPIGNVGAPGAKGARGSAGPPGATGFPGAAGRVGPPGPSGNAGPPGPP
GP
AGKEGGKGPRGETGPAGRPGEVGPPGPPGPAGEKGSPGADGPAGAPGTPGPQGIAGQRGV
VGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERGPPGPMGPPGLAGPPGESGREGAPGA
EGSPGRDGSPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPT
GPVGP
VGARGPAGPQGPRGDKGETGEQGDRGIKGHRGFSGLQGPPGPPGSPGEQGPSGASGPAGP
RGPPGSAGAPGKDGLNGLPGPIGPPGPRGRTGDAGPVGPPGPPGPPGPPGPPSAG
FDFSF
LPQPPQEKAHDGGRYYRADDANVVRDRDLEVDTTLKSLSQQIENIRSPEGSRKNPARTCR
DLKMCHSDWKSGEYWIDPNQGCNLDAIKVFCNMETGETCVYPTQPSVAQKNWYISKNPKD
KRHVWFGESMTDGFQFEYGGQGSDPADVAIQLTFLRLMSTEASQNITYHCKNSVAYMDQQ
TGNLKKALLLQGSNEIEIRAEGNSRFTYSVTVDGCTSHTGAWGKTVIEYKTTKTSRLPII
DVAPLDVGAPDQEFGFDVGPVCF
L
Sequence length 1464
Interactions View interactions

| © 2021, Biomedical Informatics Centre, NIRRH |
ICMR-National Institute for Research in Reproductive Health, Jehangir Merwanji Street, Parel, Mumbai-400012
Tel: +91-22-24192104, Fax No: +91-22-24139412