Gene Gene information from NCBI Gene database.
Entrez ID 1288
Gene name Collagen type IV alpha 6 chain
Gene symbol COL4A6
Synonyms (NCBI Gene)
CXDELq22.3DELXq22.3DFNX6
Chromosome X
Chromosome location Xq22.3
Summary This gene encodes one of the six subunits of type IV collagen, the major structural component of basement membranes. Like the other members of the type IV collagen gene family, this gene is organized in a head-to-head conformation with another type IV col
SNPs SNP information provided by dbSNP.
3
SNP ID Visualize variation Clinical significance Consequence
rs143895379 C>T Conflicting-interpretations-of-pathogenicity Missense variant, non coding transcript variant, coding sequence variant
rs769211787 A>C Likely-pathogenic Genic downstream transcript variant, terminator codon variant, stop lost
rs779748859 C>T Pathogenic Coding sequence variant, missense variant, non coding transcript variant
miRNA miRNA information provided by mirtarbase database.
29
miRTarBase ID miRNA Experiments Reference
MIRT048147 hsa-miR-197-3p CLASH 23622248
MIRT053742 hsa-miR-29b-3p Microarray 22942087
MIRT902994 hsa-miR-1252 CLIP-seq
MIRT902995 hsa-miR-1303 CLIP-seq
MIRT902996 hsa-miR-299-3p CLIP-seq
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
21
GO ID Ontology Definition Evidence Reference
GO:0005201 Function Extracellular matrix structural constituent IEA
GO:0005515 Function Protein binding IPI 32814053
GO:0005576 Component Extracellular region IEA
GO:0005576 Component Extracellular region TAS
GO:0005581 Component Collagen trimer IEA
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
303631 2208 ENSG00000197565
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
Q14031
Protein name Collagen alpha-6(IV) chain
Protein function Type IV collagen is the major structural component of glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen.
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF01391 Collagen 45 106 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 101 161 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 164 222 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 357 418 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 489 550 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 659 705 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 754 815 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 795 873 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 859 921 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 894 961 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 967 1027 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1011 1076 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1077 1136 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1133 1192 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1193 1252 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1254 1315 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1310 1372 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1375 1434 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1407 1468 Collagen triple helix repeat (20 copies) Repeat
PF01413 C4 1468 1573 C-terminal tandem repeated domain in type 4 procollagen Domain
PF01413 C4 1576 1689 C-terminal tandem repeated domain in type 4 procollagen Domain
Sequence
MLINKLWLLLVTLCLTEELAAAGEKSYGKPCGGQDCSGSCQCFPEKGARGRPGPIGIQGP
TGPQGFTGSTGLSGLKGERGFPGLLGPYGPKGDKGPMGVP
GFLGINGIPGHPGQPGPRGP
PGLDGCNGTQGAVGFPGPDGYPGLLGPPGLPGQKGSKGDPV
LAPGSFKGMKGDPGLPGLD
GITGPQGAPGFPGAVGPAGPPGLQGPPGPPGPLGPDGNMGLG
FQGEKGVKGDVGLPGPAG
PPPSTGELEFMGFPKGKKGSKGEPGPKGFPGISGPPGFPGLGTTGEKGEKGEKGIPGLPG
PRGPMGSEGVQGPPGQQGKKGTLGFPGLNGFQGIEGQKGDIGLPGPDVFIDIDGAVISGN
PGDPGVPGLPGLKGDEGIQGLRGPSGVPGLPALSGVPGALGPQGFPGLKGDQGNPGRT
TI
GAAGLPGRDGLPGPPGPPGPPSPEFETETLHNKESGFPGLRGEQGPKGNLGLKGIKGDSG
FCACDGGVPNTGPPGEPGPPGPWGLIGLPGLKGARGDRGSGGAQGPAGAPGLVGPLGPSG
PKGKKGEPIL
STIQGMPGDRGDSGSQGFRGVIGEPGKDGVPGLPGLPGLPGDGGQGFPGE
KGLPGLPGEKGHPGPPGLPGNGLPGLPGPRGLPGDKGKDGLPGQQGLPGSKGITLPCIIP
GSYGPSGFPGTPGFPGPKGSRGLPGTPGQPGSSGSKGEPGSPGLV
HLPELPGFPGPRGEK
GLPGFPGLPGKDGLPGMIGSPGLPGSKGATGDIFGAENGAPGEQGLQGLTGHKGFLGDSG
LPGLKGVHGKPGLL
GPKGERGSPGTPGQVGQPGTPGSSGPYGIKGKSGLPGAPGFPGISG
HPGKKGTRGKKGPPGSIVKKGLPGLKGLPGNPGLVGLKGSPGSPGVAGLPALSGPKGEKG
SVGFVGFPGIPGLPGIPGTRG
LKGIPGSTGKMGPSGRAGTPGEKGDRGNPGPVGIPSPRR
P
MSNLWLKGDKGSQGSAGSNGFPGPRGDKGEAGRPGPPGLPGAPGLPGIIKGVSGKPGPP
GFMGIRG
LPGLKGSSGITGFPGMPGESGSQGIRGSPGLPGASGLPGLKGDNGQTVE
ISGS
PGPKGQPGESGFKGTKGRDGLIGNIGFPGNKGEDGKVGVSGDVGLPGAPGFP
GVAGMRGE
PGLPGSSGHQGAIGPLGSPGLIGPKGFPGFPGLHGLNGLPGTKGTHGTPGPS
ITGVPGPA
GLPGPKGEKGYPGIGIGAPGKPGLRGQKGDRGFPGLQGPAGLPGAPGISLPS
LIAGQPGD
PGRPGLDGERGRPGPAGPPGPPGPSSNQGDTGDPGFPGIPGPKGPKGDQ
GIPGFSGLPGE
LGLKGMRGEPGFMGTPGKVGPPGDPGFPGMKGKAGPRGSSGLQGDPGQTPTA
EAVQVPPG
PLGLPGIDGIPGLTGDPGAQGPVGLQ
GSKGLPGIPGKDGPSGLPGPPGALGDPGLPGLQG
PPGFEGAPGQQGPFGMPGMPGQSMRVGYTLVKHSQSEQVPPCPIGMSQLWVGYSLLFVEG
QEKAHNQDLGFAGSCLPRFSTMPFIYCNINEVCHYARRNDKSYWLSTTAPIPMMPVSQTQ
IPQYISRCSVCEA
PSQAIAVHSQDITIPQCPLGWRSLWIGYSFLMHTAAGAEGGGQSLVS
PGSCLEDFRATPFIECSGARGTCHYFANKYSFWLTTVEERQQFGELPVSETLKAGQLHTR
VSRCQVCMK
SL
Sequence length 1691
Interactions View interactions
Associated diseases Disease associations from ClinVar categorized as Causal (Pathogenic/Likely Pathogenic) or Unknown.
145
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
COL4A6-related disorder Likely pathogenic rs769211787 RCV003965228
Hearing loss, X-linked 6 Likely pathogenic rs779748859, rs2035656072 RCV000088659
RCV004585112
Unknown Diseases with uncertain, conflicting, or no pathogenic evidence in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Clear cell carcinoma of kidney Benign rs143146797 RCV005907229
Gastric cancer Likely benign rs142593944 RCV005926624
Lung cancer Benign rs73636380 RCV005923546
Lymphedema Benign; Likely benign rs34740537 RCV005625700
Associations from Text MiningDisease associations identified through Pubtator
Disease Name Relationship Type References
Adenocarcinoma Associate 23301059, 34252262
Carcinogenesis Associate 23301059
Carcinoma Basal Cell Associate 34104083
Colorectal Neoplasms Associate 16507901, 32945508
Fibromuscular Dysplasia Associate 30045277
Genetic Diseases Inborn Inhibit 26179878
Hearing Loss Associate 33840813
Kidney Diseases Associate 8587250
Labyrinth Diseases Associate 33840813
Leiomyoma Associate 23738515, 26787895, 9502408