Gene
Entrez ID Entrez Gene ID - the GENE ID in NCBI Gene database.
169044
Gene name Gene Name - the full gene name approved by the HGNC.
Collagen type XXII alpha 1 chain
Gene symbol Gene Symbol - the official gene symbol approved by the HGNC.
COL22A1
Synonyms (NCBI Gene) Gene synonyms aliases
-
Chromosome Chromosome number
8
Chromosome location Chromosomal Location - indicates the cytogenetic location of the gene or region on the chromosome.
8q24.23-q24.3
Summary Summary of gene provided in NCBI Entrez Gene.
This gene encodes member of the collagen family which is thought to contribute to the stabilization of myotendinous junctions and strengthen skeletal muscle attachments during contractile activity. It belongs to the fibril-associated collagens with interr
SNPs SNP information provided by dbSNP.
SNP ID Visualize variation Clinical significance Consequence
rs6988229 C>T Drug-response Genic upstream transcript variant, intron variant
miRNA miRNA information provided by mirtarbase database.
miRTarBase ID miRNA Experiments Reference
MIRT630881 hsa-miR-6500-3p HITS-CLIP 23824327
MIRT630880 hsa-miR-1193 HITS-CLIP 23824327
MIRT630879 hsa-miR-197-3p HITS-CLIP 23824327
MIRT630878 hsa-miR-4436b-5p HITS-CLIP 23824327
MIRT630877 hsa-miR-432-3p HITS-CLIP 23824327
Gene ontology (GO) Gene ontology information of associated ontologies with gene provided by GO database.
GO ID Ontology Definition Evidence Reference
GO:0001525 Process Angiogenesis IBA 21873635
GO:0001886 Process Endothelial cell morphogenesis IBA 21873635
GO:0005201 Function Extracellular matrix structural constituent IBA 21873635
GO:0005576 Component Extracellular region TAS
GO:0005581 Component Collagen trimer IEA
Other IDs Other ids provides unique ids of gene in databases such as OMIM, HGNC, ENSEMBLE.
MIM HGNC e!Ensembl
610026 22989 ENSG00000169436
Protein
UniProt ID Q8NFW1
Protein name Collagen alpha-1(XXII) chain
Protein function Acts as a cell adhesion ligand for skin epithelial cells and fibroblasts.
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00092 VWA 38 212 von Willebrand factor type A domain Domain
PF01391 Collagen 522 600 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 562 630 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 695 773 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 744 813 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 798 856 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 861 926 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1043 1100 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1116 1174 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1156 1226 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1249 1317 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1297 1364 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1401 1461 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1494 1553 Collagen triple helix repeat (20 copies) Repeat
Tissue specificity TISSUE SPECIFICITY: Restrictive expression is observed at tissue junctions such as the myotendinous junction in skeletal and heart muscle, the articular cartilage-synovial fluid junction, or the border between the anagen hair follicle and the dermis in th
Sequence
MAGLRGNAVAGLLWMLLLWSGGGGCQAQRAGCKSVHYDLVFLLDTSSSVGKEDFEKVRQW
VANLVDTFEVGPDRTRVGVVRYSDRPTTAFELGLFGSQEEVKAAARRLAYHGGNTNTGDA
LRYITARSFSPHAGGRPRDRAYKQVAILLTDGRSQDLVLDAAAAAHRAGIRIFAVGVGEA
LKEELEEIASEPKSAHVFHVSDFNAIDKIRGK
LRRRLCENVLCPSVRVEGDRFKHTNGGT
KEITGFDLMDLFSVKEILGKRENGAQSSYVRMGSFPVVQSTEDVFPQGLPDEYAFVTTFR
FRKTSRKEDWYIWQVIDQYSIPQVSIRLDGENKAVEYNAVGAMKDAVRVVFRGSRVNDLF
DRDWHKMALSIQAQNVSLHIDCALVQTLPIEERENIDIQGKTVIGKRLYDSVPIDFDLQR
IVIYCDSRHAELETCCDIPSGPCQVTVVTEPPPPPPPQRPPTPGSEQIGFLKTINCSCPA
GEKGEMGVAGPMGLPGPKGDIGAIGPVGAPGPKGEKGDVGIGPFGQGEKGEKGSLGLPGP
PGRDGSKGMRGEPGELGEPGL
PGEVGMRGPQGPPGLPGPPGRVGAPGLQGERGEKGTRGE
KGERGLDGFPGKPGDTGQQGRPGPSGVAGP
QGEKGDVGPAGPPGVPGSVVQQEGLKGEQG
APGPRGHQGAPGPPGARGPIGPEGRDGPPGLQGLRGKKGDMGPPGIPGLLGLQGPPGPPG
VPGPPGPGGSPGLPGEIGFPGKP
GPPGPTGPPGKDGPNGPPGPPGTKGEPGERGEDGLPG
KPGLRGEIGEQGLAGRPGEKGEAGLPGAPGFPGVRGEKGDQGEKGELGLPGLKGDRGEKG
EAGPAGPPGLPGTTSL
FTPHPRMPGEQGPKGEKGDPGLPGEPGLQGRPGELGPQGPTGPP
GAKGQEGAHGAPGAAGNPGAPGHVGA
PGPSGPPGSVGAPGLRGTPGKDGERGEKGAAGEE
GSPGPVGPRGDPGAPGLPGPPGKGKDGEPGLRGSPGLPGPLGTKAACGKVRGSENCALGG
QCVKGDRGAPGIPGSPGSRGDPGIGVAGPPGPSGPPGDKGSPGSRGLPGFPGPQGPAGRD
GAPGNPGERGPPGKPGLSSL
LSPGDINLLAKDVCNDCPPGPPGLPGLPGFKGDKGVPGKP
GREGTEGKKGEAGPP
GLPGPPGIAGPQGSQGERGADGEVGQKGDQGHPGVPGFMGPPGNP
GPPGADGIAGAAGPPGIQGSPGKEGP
PGPQGPSGLPGIPGEEGKEGRDGKPGPPGEPGKA
GEPGLPGPEGARGPPGFKGHTGDSGAPGPRGESGAM
GLPGQEGLPGKDGDTGPTGPQGPQ
GPRGPPGKNGSPGSPGEPGPSGTPGQKGSKGENGSPGLPGFLGP
RGPPGEPGEKGVPGKE
GVPGKPGEPGFKGERGDPGIKGDKGPPGGKGQPGDPGIPGHKGHTGLMGPQGLPGENGPV
GPPGPPGQPGFPGLRGESPSM
ETLRRLIQEELGKQLETRLAYLLAQMPPAYMKSSQGRPG
PPGPPGKDGLPGRAGPMGEPGRPGQGGLEGPSGPIGPKGERGAKGDPGAPGVG
LRGEMGP
PGIPGQPGEPGYAKDGLPGIPGPQGETGPAGHPGLPGPPGPPGQCDPSQCAYFASLAARP
GNVKGP
Sequence length 1626
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG   Reactome
  Protein digestion and absorption   Collagen biosynthesis and modifying enzymes
Collagen chain trimerization
Associated diseases Disease information provided by ClinVar, GenCC, and GWAS databases.
Causal
Disease term Disease name dbSNP ID References
Leukemia Leukemia, Myelocytic, Acute rs121909646, rs121913488, rs587776834, rs752746786, rs869312821, rs767454740, rs1554564297 27903959
Lung carcinoma Small cell carcinoma of lung rs1805076, rs121909071, rs121913530, rs112445441, rs121913529, rs121913535, rs121913297, rs121913279, rs104886003, rs397516975, rs11554290, rs121913364, rs121913351, rs121913369, rs121913355
View all (44 more)
22941189
Unknown
Disease term Disease name Evidence References Source
Mental depression Major Depressive Disorder 31123309 ClinVar
Asthma Asthma GWAS
Retinal Detachment Retinal Detachment GWAS
Cervical Cancer Cervical Cancer Our screens identified 10 miRNAs that enhance fitness of HeLa cells and have been reported to be up-regulated in cervical cancer (Table2). GWAS, CBGDA
Associations from Text Mining
Disease Name Relationship Type References
Carcinoma Squamous Cell Associate 32596344
Cardiomyopathy Dilated Associate 38460409
Fibrosis Associate 30678304
Gastrointestinal Stromal Tumors Associate 25239601, 28130400
Glaucoma Associate 31816047
Head and Neck Neoplasms Associate 34635145
Macular Degeneration Associate 31816047
Migraine Disorders Associate 28627999
Neoplasm Metastasis Associate 36189307
Osteosarcoma Associate 36189307