Gene
Entrez ID Entrez Gene ID - the GENE ID in NCBI Gene database.
1303
Gene name Gene Name - the full gene name approved by the HGNC.
Collagen type XII alpha 1 chain
Gene symbol Gene Symbol - the official gene symbol approved by the HGNC.
COL12A1
Synonyms (NCBI Gene) Gene synonyms aliases
BA209D8.1, BTHLM2, COL12A1L, DJ234P15.1, EDSMYP, UCMD2
Disease Acronyms (UniProt) Disease acronyms from UniProt database
BTHLM2, UCMD2
Chromosome Chromosome number
6
Chromosome location Chromosomal Location - indicates the cytogenetic location of the gene or region on the chromosome.
6q13-q14.1
Summary Summary of gene provided in NCBI Entrez Gene.
This gene encodes the alpha chain of type XII collagen, a member of the FACIT (fibril-associated collagens with interrupted triple helices) collagen family. Type XII collagen is a homotrimer found in association with type I collagen, an association that i
SNPs SNP information provided by dbSNP.
SNP ID Visualize variation Clinical significance Consequence
rs41266761 C>T Likely-benign, conflicting-interpretations-of-pathogenicity Coding sequence variant, missense variant
rs151324784 C>T Conflicting-interpretations-of-pathogenicity Coding sequence variant, missense variant
rs200201449 C>T Conflicting-interpretations-of-pathogenicity Coding sequence variant, missense variant
rs200443479 G>A,T Conflicting-interpretations-of-pathogenicity Coding sequence variant, intron variant, missense variant
rs200487396 G>A,T Uncertain-significance, pathogenic Coding sequence variant, missense variant
miRNA miRNA information provided by mirtarbase database.
miRTarBase ID miRNA Experiments Reference
MIRT020009 hsa-miR-375 Microarray 20215506
MIRT021464 hsa-miR-9-5p Microarray 17612493
MIRT023070 hsa-miR-124-3p Proteomics;Microarray 18668037
MIRT024050 hsa-miR-1-3p Proteomics;Microarray 18668037
MIRT438071 hsa-miR-26b-5p pSILAC 23939832
Gene ontology (GO) Gene ontology information of associated ontologies with gene provided by GO database.
GO ID Ontology Definition Evidence Reference
GO:0005576 Component Extracellular region HDA 27068509
GO:0005576 Component Extracellular region TAS
GO:0005595 Component Collagen type XII trimer NAS 1427837
GO:0005615 Component Extracellular space HDA 20551380
GO:0005615 Component Extracellular space IBA 21873635
Other IDs Other ids provides unique ids of gene in databases such as OMIM, HGNC, ENSEMBLE.
MIM HGNC e!Ensembl
120320 2188 ENSG00000111799
Protein
UniProt ID Q99715
Protein name Collagen alpha-1(XII) chain
Protein function Type XII collagen interacts with type I collagen-containing fibrils, the COL1 domain could be associated with the surface of the fibrils, and the COL2 and NC3 domains may be localized in the perifibrillar matrix.
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00041 fn3 26 105 Fibronectin type III domain Domain
PF00092 VWA 140 311 von Willebrand factor type A domain Domain
PF00041 fn3 335 416 Fibronectin type III domain Domain
PF00092 VWA 440 611 von Willebrand factor type A domain Domain
PF00041 fn3 633 712 Fibronectin type III domain Domain
PF00041 fn3 724 801 Fibronectin type III domain Domain
PF00041 fn3 815 894 Fibronectin type III domain Domain
PF00041 fn3 906 986 Fibronectin type III domain Domain
PF00041 fn3 999 1077 Fibronectin type III domain Domain
PF00041 fn3 1088 1168 Fibronectin type III domain Domain
PF00092 VWA 1199 1370 von Willebrand factor type A domain Domain
PF00041 fn3 1386 1465 Fibronectin type III domain Domain
PF00041 fn3 1475 1556 Fibronectin type III domain Domain
PF00041 fn3 1567 1646 Fibronectin type III domain Domain
PF00041 fn3 1656 1734 Fibronectin type III domain Domain
PF00041 fn3 1754 1835 Fibronectin type III domain Domain
PF00041 fn3 1845 1926 Fibronectin type III domain Domain
PF00041 fn3 1937 2016 Fibronectin type III domain Domain
PF00041 fn3 2026 2108 Fibronectin type III domain Domain
PF00041 fn3 2117 2195 Fibronectin type III domain Domain
PF00041 fn3 2205 2283 Fibronectin type III domain Domain
PF00092 VWA 2323 2495 von Willebrand factor type A domain Domain
PF01391 Collagen 2745 2802 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 2800 2853 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 2844 2902 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 2937 2994 Collagen triple helix repeat (20 copies) Repeat
Tissue specificity TISSUE SPECIFICITY: Found in collagen I-containing tissues: both isoform 1 and isoform 2 appear in amnion, chorion, skeletal muscle, small intestine, and in cell culture of dermal fibroblasts, keratinocytes and endothelial cells. Only isoform 2 is found i
Sequence
MRSRLPPALAALGAALLLSSIEAEVDPPSDLNFKIIDENTVHMSWAKPVDPIVGYRITVD
PTTDGPTKEFTLSASTTETLLSELVPETEYVVTITSYDEVEESVP
VIGQLTIQTGSSTKP
VEKKPGKTEIQKCSVSAWTDLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVG
VVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLVKNTFTESAGARVG
FPKVAIIITDGKSQDEVEIPARELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFNVA
NFDAIVDIQNE
IISQVCSGVDEQLGELVSGEEVVEPPSNLIAMEVSSKYVKLNWNPSPSP
VTGYKVILTPMTAGSRQHALSVGPQTTTLSVRDLSADTEYQISVSAMKGMTSSEPI
SIME
KTQPMKVQVECSRGVDIKADIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQIS
LVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFVPSKGSRSN
VPKVMILITDGKSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSELEAIASPPAETHVFTVE
DFDAFQRISFE
LTQSICLRIEQELAAIKKKAYVPPKDLSFSEVTSYGFKTNWSPAGENVF
SYHITYKEAAGDDEVTVVEPASSTSVVLSSLKPETLYLVNVTAEYEDGFSIP
LAGEETTE
EVKGAPRNLKVTDETTDSFKITWTQAPGRVLRYRIIYRPVAGGESREVTTPPNQRRRTLE
NLIPDTKYEVSVIPEYFSGPG
TPLTGNAATEEVRGNPRDLRVSDPTTSTMKLSWSGAPGK
VKQYLVTYTPVAGGETQEVTVRGDTTNTVLQGLKEGTQYALSVTALYASGAGDA
LFGEGT
TLEERGSPQDLVTKDITDTSIGAYWTSAPGMVRGYRVSWKSLYDDVDTGEKNLPEDAIHT
MIENLQPETKYRISVFATYSSGEGEP
LTGDATTELSQDSKTLKVDEETENTMRVTWKPAP
GKVVNYRVVYRPHGRGKQMVAKVPPTVTSTVLKRLQPQTTYDITVLPIYKMGEGKLR
QGS
GTTASRFKSPRNLKTSDPTMSSFRVTWEPAPGEVKGYKVTFHPTGDDRRLGELVVGPYDN
TVVLEELRAGTTYKVNVFGMFDGGESSP
LVGQEMTTLSDTTVMPILSSGMECLTRAEADI
VLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKK
SLLQAVANLPYKGGNTLTGMALNFIRQQNFRTQAGMRPRARKIGVLITDGKSQDDVEAPS
KKLKDEGVELFAIGIKNADEVELKMIATDPDDTHAYNVADFESLSRIVDD
LTINLCNSVK
GPGDLEAPSNLVISERTHRSFRVSWTPPSDSVDRYKVEYYPVSGGKRQEFYVSRMETSTV
LKDLKPETEYVVNVYSVVEDEYSEP
LKGTEKTLPVPVVSLNIYDVGPTTMHVQWQPVGGA
TGYILSYKPVKDTEPTRPKEVRLGPTVNDMQLTDLVPNTEYAVTVQAVLHDLTSEP
VTVR
EVTLPLPRPQDLKLRDVTHSTMNVFWEPVPGKVRKYIVRYKTPEEDVKEVEVDRSETSTS
LKDLFSQTLYTVSVSAVHDEGESPPV
TAQETTRPVPAPTNLKITEVTSEGFRGTWDHGAS
DVSLYRITWAPFGSSDKMETILNGDENTLVFENLNPNTIYEVSITAIYPDESES
DDLIGS
ERTLPILTTQAPKSGPRNLQVYNATSNSLTVKWDPASGRVQKYRITYQPSTGEGNEQTTT
IGGRQNSVVLQKLKPDTPYTITVSSLYPDGEGGRM
TGRGKTKPLNTVRNLRVYDPSTSTL
NVRWDHAEGNPRQYKLFYAPAAGGPEELVPIPGNTNYAILRNLQPDTSYTVTVVPVYTEG
DGGRTS
DTGRTLMRGLARNVQVYNPTPNSLDVRWDPAPGPVLQYRVVYSPVDGTRPSESI
VVPGNTRMVHLERLIPDTLYSVNLVALYSDGEGNPS
PAQGRTLPRSGPRNLRVFGETTNS
LSVAWDHADGPVQQYRIIYSPTVGDPIDEYTTVPGRRNNVILQPLQPDTPYKITVIAVYE
DGDGGHLT
GNGRTVGLLPPQNIHISDEWYTRFRVSWDPSPSPVLGYKIVYKPVGSNEPME
AFVGEMTSYTLHNLNPSTTYDVNVYAQYDSGLSVP
LTDQGTTLYLNVTDLKTYQIGWDTF
CVKWSPHRAATSYRLKLSPADGTRGQEITVRGSETSHCFTGLSPDTDYGVTVFVQTPNLE
GPG
VSVKEHTTVKPTEAPTEPPTPPPPPTIPPARDVCKGAKADIVFLTDASWSIGDDNFN
KVVKFIFNTVGGFDEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYRGGN
TRTGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQDEVKKAALVIQQSGFSVFVVGV
ADVDYNELANIASKPSERHVFIVDDFESFEKIEDN
LITFVCETATSSCPLIYLDGYTSPG
FKMLEAYNLTEKNFASVQGVSLESGSFPSYSAYRIQKNAFVNQPTADLHPNGLPPSYTII
LLFRLLPETPSDPFAIWQITDRDYKPQVGVIADPSSKTLSFFNKDTRGEVQTVTFDTEEV
KTLFYGSFHKVHIVVTSKSVKIYIDCYEIIEKDIKEAGNITTDGYEILGKLLKGERKSAA
FQIQSFDIVCSPVWTSRDRCCDIPSRRDEGKCPAFPNSCTCTQDSVGPPGPPGPAGGPGA
KGPRGERGISGAIGPPGPRGDIGPPGPQGPPGPQGPNGL
SIPGEQGRQGMKGDAGEPGLP
GRTGTPGLPGPPGPMGPPGDRGFTGKDGAMGPRGPPGPPGSPGSPGVTGPSGKPGKPGDH
GRPGPSGLKGEKGDRGDIASQN
MMRAVARQVCEQLISGQMNRFNQMLNQIPNDYQSSRNQ
PGPPGPPGPPGSAGARGEPGPGGRPGFPGTPGMQGPPGERGLPGEKGERGTGSS
GPRGLP
GPPGPQGESRTGPPGSTGSRGPPGPPGRPGNSGIRGPPGPPGYCDSSQCASIPYNGQGYP
GSG
Sequence length 3063
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG   Reactome
  Protein digestion and absorption   Collagen degradation
Collagen biosynthesis and modifying enzymes
Assembly of collagen fibrils and other multimeric structures
Collagen chain trimerization
Associated diseases Disease information provided by ClinVar, GenCC, and GWAS databases.
Causal
Disease term Disease name dbSNP ID References
Alzheimer disease Alzheimer`s Disease rs63750215, rs28936379, rs63749851, rs63749884, rs28936380, rs63750048, rs63750579, rs63750264, rs63749964, rs63750671, rs281865161, rs63750066, rs63750399, rs63750734, rs63751039
View all (65 more)
26830138
Bethlem myopathy BETHLEM MYOPATHY 2, BETHLEM MYOPATHY 1, Bethlem myopathy rs121434553, rs121434555, rs139260335, rs796052093, rs796052094, rs1768488927 24334769, 24334604, 27348394, 27159402
Congenital muscular dystrophy Congenital muscular dystrophy (disorder), Congenital muscular dystrophy, Ullrich type rs1565627745, rs774985604, rs1565629479, rs121913572, rs121913575, rs121913576, rs58932704, rs58912633, rs121912495, rs121912496, rs60458016, rs387906881, rs750764003, rs387907068, rs786205117
View all (45 more)
Microcephaly, epilepsy, and diabetes syndrome MICROCEPHALY, EPILEPSY, AND DIABETES SYNDROME rs387907012 27348394
Unknown
Disease term Disease name Evidence References Source
Endometriosis Endometriosis 28881265 ClinVar, GWAS
Bethlem Myopathy Bethlem myopathy GenCC
Ullrich Congenital Muscular Dystrophy Ullrich congenital muscular dystrophy 2 GenCC
Melanoma Melanoma SMAD3 and SLC9A5 gain?of?function increases invasion capability of melanoma cells. GWAS, CBGDA
Associations from Text Mining
Disease Name Relationship Type References
Adenoma Associate 30175151
Arthritis Rheumatoid Associate 26191188
Bethlem myopathy Associate 27348394, 31273343
Bone Neoplasms Associate 19648885
Breast Neoplasms Associate 23939832, 38235137
Carcinoma Pancreatic Ductal Associate 34007003
Cholangiocarcinoma Associate 36694230
Colorectal Neoplasms Stimulate 22408128
Colorectal Neoplasms Associate 26208990, 32323816, 32356618
Connective Tissue Diseases Associate 27348394