Gene Gene information from NCBI Gene database.
Entrez ID 80781
Gene name Collagen type XVIII alpha 1 chain
Gene symbol COL18A1
Synonyms (NCBI Gene)
GLCCKNOKNO1KS
Chromosome 21
Chromosome location 21q22.3
Summary This gene encodes the alpha chain of type XVIII collagen. This collagen is one of the multiplexins, extracellular matrix proteins that contain multiple triple-helix domains (collagenous domains) interrupted by non-collagenous domains. A long isoform of th
SNPs SNP information provided by dbSNP.
9
SNP ID Visualize variation Clinical significance Consequence
rs749009747 C>-,CC Pathogenic, likely-pathogenic Frameshift variant, coding sequence variant
rs775168204 C>-,CC Pathogenic Frameshift variant, coding sequence variant
rs778612831 A>G Likely-pathogenic Splice acceptor variant
rs778909108 C>-,CC Likely-pathogenic Frameshift variant, coding sequence variant
rs786205554 C>G,T Likely-pathogenic Stop gained, missense variant, coding sequence variant
miRNA miRNA information provided by mirtarbase database.
186
miRTarBase ID miRNA Experiments Reference
MIRT049203 hsa-miR-92a-3p CLASH 23622248
MIRT037044 hsa-miR-877-3p CLASH 23622248
MIRT527343 hsa-miR-4453 PAR-CLIP 22012620
MIRT527342 hsa-miR-4538 PAR-CLIP 22012620
MIRT527341 hsa-miR-6852-5p PAR-CLIP 22012620
Transcription factors Transcription factors information provided by TRRUST V2 database.
3
Transcription factor Regulation Reference
FOXA2 Unknown 11093745
NFIC Unknown 11093745
SP1 Unknown 11093745
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
31
GO ID Ontology Definition Evidence Reference
GO:0001501 Process Skeletal system development IBA
GO:0001525 Process Angiogenesis IBA
GO:0001886 Process Endothelial cell morphogenesis IBA
GO:0005515 Function Protein binding IPI 17615292, 19542224, 19877579, 24117177
GO:0005576 Component Extracellular region HDA 27068509
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
120328 2195 ENSG00000182871
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
P39060
Protein name Collagen alpha-1(XVIII) chain [Cleaved into: Endostatin; Non-collagenous domain 1 (NC1)]
Protein function Probably plays a major role in determining the retinal structure as well as in the closure of the neural tube. ; [Non-collagenous domain 1]: May regulate extracellular matrix-dependent motility and morphog
PDB 1BNL , 3HON , 3HSH
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF06121 DUF959 13 194 Domain of Unknown Function (DUF959) Domain
PF01392 Fz 334 440 Fz domain Domain
PF01391 Collagen 791 857 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 927 990 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1038 1096 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1070 1128 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1186 1246 Collagen triple helix repeat (20 copies) Repeat
PF06482 Endostatin 1443 1750 Collagenase NC10 and Endostatin Domain
Tissue specificity TISSUE SPECIFICITY: Detected in placenta (at protein level) (PubMed:32337544). Present in multiple organs with highest levels in liver, lung and kidney. {ECO:0000269|PubMed:32337544}.
Sequence
MAPYPCGCHILLLLFCCLAAARANLLNLNWLWFNNEDTSHAATTIPEPQGPLPVQPTADT
TTHVTPRNGSTEPATAPGSPEPPSELLEDGQDTPTSAESPDAPEENIAGVGAEILNVAKG
IRSFVQLWNDTVPTESLARAETLVLETPVGPLALAGPSSTPQENGTTLWPSRGIPSSPGA
HTTEAGTLPAPTPS
PPSLGRPWAPLTGPSVPPPSSGRASLSSLLGGAPPWGSLQDPDSQG
LSPAAAAPSQQLQRPDVRLRTPLLHPLVMGSLGKHAAPSAFSSGLPGALSQVAVTTLTRD
SGAWVSHVANSVGPGLANNSALLGADPEAPAGRCLPLPPSLPVCGHLGISRFWLPNHLHH
ESGEQVRAGARAWGGLLQTHCHPFLAWFFCLLLVPPCGSVPPPAPPPCCQFCEALQDACW
SRLGGGRLPVACASLPTQED
GYCVLIGPAAERISEEVGLLQLLGDPPPQQVTQTDDPDVG
LAYVFGPDANSGQVARYHFPSLFFRDFSLLFHIRPATEGPGVLFAITDSAQAMVLLGVKL
SGVQDGHQDISLLYTEPGAGQTHTAASFRLPAFVGQWTHLALSVAGGFVALYVDCEEFQR
MPLARSSRGLELEPGAGLFVAQAGGADPDKFQGVIAELKVRRDPQVSPMHCLDEEGDDSD
GASGDSGSGLGDARELLREETGAALKPRLPAPPPVTTPPLAGGSSTEDSRSEEVEEQTTV
ASLGAQTLPGSDSVSTWDGSVRTPGGRVKEGGLKGQKGEPGVPGPPGRAGPPGSPCLPGP
PGLPCPVSPLGPAGPALQTVPGPQGPPGPPGRDGTPGRDGEPGDPGEDGKPGDTGPQGFP
GTPGDVGPKGDKGDPGV
GERGPPGPQGPPGPPGPSFRHDKLTFIDMEGSGFGGDLEALRG
PRGFPGPPGPPGVPGLPGEPGRFGVNSSDVPGPAGLPGVPGREGPPGFPGLPGPPGPPGR
EGPPGRTGQKGSLGEAGAPGHKGSKGAPGP
AGARGESGLAGAPGPAGPPGPPGPPGPPGP
GLPAGFDDMEGSGGPFWSTARSADGPQGPPGLPGLKGDPGVPGLPGAKGEVGADGVPGFP
GLPGREGIAGPQGPKG
DRGSRGEKGDPGKDGVGQPGLPGPPGPPGPVV
YVSEQDGSVLSV
PGPEGRPGFAGFPGPAGPKGNLGSKGERGSPGPKGEKGEPGSIFSPDGGALGPAQKGAKG
EPGFRGPPGPYGRPGYKGEIGFPGRPGRPGMNGLKGEKGEPGDASL
GFGMRGMPGPPGPP
GPPGPPGTPVYDSNVFAESSRPGPPGLPGNQGPPGPKGAKGEVGPPGPPGQFPFDFLQLE
AEMKGEKGDRGDAGQKGERGEPGGGGFFGSSLPGPPGPPGPPGPRGYPGIPGPKGESIRG
QPGPPGPQGPPGIGYEGRQGPPGPPGPPGPPSFPGPHRQTISVPGPPGPPGPPGPPGTMG
ASSGVRLWATRQAMLGQVHEVPEGWLIFVAEQEELYVRVQNGFRKVQLEARTPLPRGTDN
EVAALQPPVVQLHDSNPYPRREHPHPTARPWRADDILASPPRLPEPQPYPGAPHHSSYVH
LRPARPTSPPAHSHRDFQPVLHLVALNSPLSGGMRGIRGADFQCFQQARAVGLAGTFRAF
LSSRLQDLYSIVRRADRAAVPIVNLKDELLFPSWEALFSGSEGPLKPGARIFSFDGKDVL
RHPTWPQKSVWHGSDPNGRRLTESYCETWRTEAPSATGQASSLLGGRLLGQSAASCHHAY
IVLCIENSFM
TASK
Sequence length 1754
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG  Reactome
  Protein digestion and absorption   Collagen degradation
Activation of Matrix Metalloproteinases
Collagen biosynthesis and modifying enzymes
Assembly of collagen fibrils and other multimeric structures
Integrin cell surface interactions
Laminin interactions
Collagen chain trimerization
Associated diseases Disease associations from ClinVar categorized as Causal (Pathogenic/Likely Pathogenic) or Unknown.
657
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Cataract Pathogenic rs765919785, rs1057518802 RCV000414765
RCV000415261
Clear cell carcinoma of kidney Likely pathogenic rs776147216 RCV005927020
COL18A1-related disorder Pathogenic rs749541171, rs775168204 RCV004548977
RCV003335083
Cowden syndrome 1 Pathogenic rs2037488871 RCV005861222
Unknown Diseases with uncertain, conflicting, or no pathogenic evidence in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Acute myeloid leukemia Benign rs9977637 RCV005923686
Cervical cancer Uncertain significance rs200779516 RCV005897503
Colon adenocarcinoma Uncertain significance; Benign; Likely benign rs375628545, rs200779516, rs78227997 RCV005897490
RCV005897502
RCV005870677
EBV-positive nodal T- and NK-cell lymphoma Likely benign rs946925859 RCV004560199
Associations from Text MiningDisease associations identified through Pubtator
Disease Name Relationship Type References
AA amyloidosis Associate 12408231
Adenoma Associate 21274745
Adrenal Insufficiency Associate 32178553
Adrenocortical Carcinoma Associate 22358232
Alzheimer Disease Associate 12408231, 33633844
Anemia Sickle Cell Associate 38353026
Aortic Aneurysm Abdominal Associate 29656960
Arthritis Rheumatoid Associate 22450926
Atrophy Associate 32178553
Breast Neoplasms Associate 16552441, 17587451