Gene
Entrez ID Entrez Gene ID - the GENE ID in NCBI Gene database.
1310
Gene name Gene Name - the full gene name approved by the HGNC.
Collagen type XIX alpha 1 chain
Gene symbol Gene Symbol - the official gene symbol approved by the HGNC.
COL19A1
Synonyms (NCBI Gene) Gene synonyms aliases
COL9A1L, D6S228E
Chromosome Chromosome number
6
Chromosome location Chromosomal Location - indicates the cytogenetic location of the gene or region on the chromosome.
6q13
Summary Summary of gene provided in NCBI Entrez Gene.
This gene encodes the alpha chain of type XIX collagen, a member of the FACIT collagen family (fibril-associated collagens with interrupted helices). Although the function of this collagen is not known, other members of this collagen family are found in a
miRNA miRNA information provided by mirtarbase database.
miRTarBase ID miRNA Experiments Reference
MIRT718519 hsa-miR-581 HITS-CLIP 19536157
MIRT718518 hsa-miR-3689a-5p HITS-CLIP 19536157
MIRT718517 hsa-miR-3689b-5p HITS-CLIP 19536157
MIRT718516 hsa-miR-3689e HITS-CLIP 19536157
MIRT718515 hsa-miR-3689f HITS-CLIP 19536157
Gene ontology (GO) Gene ontology information of associated ontologies with gene provided by GO database.
GO ID Ontology Definition Evidence Reference
GO:0001501 Process Skeletal system development TAS 9143499
GO:0005201 Function Extracellular matrix structural constituent IBA 21873635
GO:0005201 Function Extracellular matrix structural constituent NAS 1639419, 7775380
GO:0005576 Component Extracellular region TAS
GO:0005581 Component Collagen trimer NAS 7775380
Other IDs Other ids provides unique ids of gene in databases such as OMIM, HGNC, ENSEMBLE.
MIM HGNC e!Ensembl
120165 2196 ENSG00000082293
Protein
UniProt ID Q14993
Protein name Collagen alpha-1(XIX) chain (Collagen alpha-1(Y) chain)
Protein function May act as a cross-bridge between fibrils and other extracellular matrix molecules. Involved in skeletal myogenesis in the developing esophagus. May play a role in organization of the pericellular matrix or the sphinteric smooth muscle. {ECO:000
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF01391 Collagen 289 350 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 331 395 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 379 441 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 452 533 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 565 628 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 625 681 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 756 821 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 838 904 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 887 957 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 947 1012 Collagen triple helix repeat (20 copies) Repeat
Tissue specificity TISSUE SPECIFICITY: Localized to vascular, neuronal, mesenchymal, and some epithelial basement membrane zones in umbilical cord. {ECO:0000269|PubMed:12788917}.
Sequence
MRLTGPWKLWLWMSIFLLPASTSVTVRDKTEESCPILRIEGHQLTYDNINKLEVSGFDLG
DSFSLRRAFCESDKTCFKLGSALLIRDTIKIFPKGLPEEYSVAAMFRVRRNAKKERWFLW
QVLNQQNIPQISIVVDGGKKVVEFMFQATEGDVLNYIFRNRELRPLFDRQWHKLGISIQS
QVISLYMDCNLIARRQTDEKDTVDFHGRTVIATRASDGKPVDIELHQLKIYCSANLIAQE
TCCEISDTKCPEQDGFGNIASSWVTAHASKMSSYLPAKQELKDQCQCIPNKGEAGLPGAP
GSPGQKGHKGEPGENGLHGAPGFPGQKGEQ
GFEGSKGETGEKGEQGEKGDPALAGLNGEN
GLKGDLGPHGPPGPKGEKGDTGPPGPPALPGSLGIQGPQGPPGKEGQRGRRGKTGPPGKP
GPPGPPGPPGIQGIHQTLGGY
YNKDNKGNDEHEAGGLKGDKGETGLPGFPGSVGPKGQKG
EPGEPFTKGEKGDRGEPGVIGSQGVKGEPGDPGPPGLIGSPGLKGQQGSAGSM
GPRGPPG
DVGLPGEHGIPGKQGIKGEKGDPGGIIGPPGLPGPKGEAGPPGKSLPGEPGLDGNPGAPG
PRGPKGERGLPGVHGSPGDIGPQG
IGIPGRTGAQGPAGEPGIQGPRGLPGLPGTPGTPGN
DGVPGRDGKPGLPGPPGDPIA
LPLLGDIGALLKNFCGNCQASVPGLKSNKGEEGGAGEPG
KYDSMARKGDIGPRGPPGIPGREGPKGSKGERGYPGIPGEKGDEGLQGIPGIPGAPGPTG
PPGLMGRTGHPGPTGAKGEKGSDGPPGKPGPPGPPGIPFNE
RNGMSSLYKIKGGVNVPSY
PGPPGPPGPKGDPGPVGEPGAMGLPGLEGFPGVKGDRGPAGPPGIA
GMSGKPGAPGPPGV
PGEP
GERGPVGDIGFPGPEGPSGKPGINGKDGIPGAQGIMGKPGDRGPKGERGDQGIPGD
RGSQGERGKPGLTGMKGAIGPMGPPGNKGSMGSPGHQGPPGSPGIPGIPADA
VSFEEIKK
YINQEVLRIFEERMAVFLSQLKLPAAMLAAQAYGRPGPPGKDGLPGPPGDPGPQGYRGQK
GERGEPGIGLPGSPGLPGTSALGLPGSPGAPGPQGPPGPSGRCNPEDCLYPVSHAHQRTG
GN
Sequence length 1142
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG   Reactome
  Protein digestion and absorption   Collagen degradation
Collagen biosynthesis and modifying enzymes
Collagen chain trimerization
Associated diseases Disease information provided by ClinVar, GenCC, and GWAS databases.
Causal
Disease term Disease name dbSNP ID References
Breast cancer Malignant neoplasm of breast rs587776547, rs1137887, rs137853007, rs587776650, rs80359351, rs80359714, rs121917783, rs104886456, rs121964878, rs80359874, rs80357868, rs80357508, rs387906843, rs80357569, rs80358158
View all (309 more)
Parkinson disease Parkinson Disease rs33939927, rs35801418, rs34805604, rs35870237, rs34995376, rs74315355, rs28940284, rs74315356, rs74315357, rs28940285, rs730880302, rs750664040, rs74315359, rs74315360, rs45539432
View all (84 more)
25475535
Unknown
Disease term Disease name Evidence References Source
Neuroticism Neuroticism GWAS
Associations from Text Mining
Disease Name Relationship Type References
Esophageal Squamous Cell Carcinoma Associate 37005910
Rhabdomyosarcoma Associate 8034603