Gene
Entrez ID Entrez Gene ID - the GENE ID in NCBI Gene database.
1287
Gene name Gene Name - the full gene name approved by the HGNC.
Collagen type IV alpha 5 chain
Gene symbol Gene Symbol - the official gene symbol approved by the HGNC.
COL4A5
Synonyms (NCBI Gene) Gene synonyms aliases
ASLN, ATS, ATS1, CA54
Chromosome Chromosome number
X
Chromosome location Chromosomal Location - indicates the cytogenetic location of the gene or region on the chromosome.
Xq22.3
Summary Summary of gene provided in NCBI Entrez Gene.
This gene encodes one of the six subunits of type IV collagen, the major structural component of basement membranes. Mutations in this gene are associated with X-linked Alport syndrome, also known as hereditary nephritis. Like the other members of the typ
SNPs SNP information provided by dbSNP.
SNP ID Visualize variation Clinical significance Consequence
rs5973838 C>A,G,T Pathogenic Intron variant
rs104886042 G>- Pathogenic Coding sequence variant, frameshift variant, genic upstream transcript variant
rs104886043 G>A Pathogenic Coding sequence variant, missense variant, genic upstream transcript variant
rs104886044 G>- Pathogenic Coding sequence variant, frameshift variant, genic upstream transcript variant
rs104886045 C>- Pathogenic Coding sequence variant, frameshift variant, genic upstream transcript variant
miRNA miRNA information provided by mirtarbase database.
miRTarBase ID miRNA Experiments Reference
MIRT018772 hsa-miR-335-5p Microarray 18185580
MIRT030043 hsa-miR-26b-5p Microarray 19088304
MIRT053738 hsa-miR-29b-3p Microarray 22942087
MIRT450890 hsa-miR-199a-3p PAR-CLIP 22100165
MIRT450889 hsa-miR-199b-3p PAR-CLIP 22100165
Gene ontology (GO) Gene ontology information of associated ontologies with gene provided by GO database.
GO ID Ontology Definition Evidence Reference
GO:0005201 Function Extracellular matrix structural constituent IEA
GO:0005515 Function Protein binding IPI 32296183, 32814053
GO:0005576 Component Extracellular region IEA
GO:0005576 Component Extracellular region TAS
GO:0005581 Component Collagen trimer IEA
Other IDs Other ids provides unique ids of gene in databases such as OMIM, HGNC, ENSEMBLE.
MIM HGNC e!Ensembl
303630 2207 ENSG00000188153
Protein
UniProt ID P29400
Protein name Collagen alpha-5(IV) chain
Protein function Type IV collagen is the major structural component of glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen.
PDB 5NAZ , 6WKU
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF01391 Collagen 33 118 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 81 164 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 165 223 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 283 352 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 391 449 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 491 550 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 598 659 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 659 706 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 706 766 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 754 818 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 796 854 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 854 909 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 896 958 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 960 1019 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1013 1072 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1074 1133 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1128 1189 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1190 1248 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1246 1315 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1400 1460 Collagen triple helix repeat (20 copies) Repeat
PF01413 C4 1462 1569 C-terminal tandem repeated domain in type 4 procollagen Domain
PF01413 C4 1572 1683 C-terminal tandem repeated domain in type 4 procollagen Domain
Tissue specificity TISSUE SPECIFICITY: Isoform 2 is found in kidney.
Sequence
MKLRGVSLAAGLFLLALSLWGQPAEAAACYGCSPGSKCDCSGIKGEKGERGFPGLEGHPG
LPGFPGPEGPPGPRGQKGDD
GIPGPPGPKGIRGPPGLPGFPGTPGLPGMPGHDGAPGPQG
IPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPGMKGEPGSIIMS
SLPGPKGNPGYPGPPG
IQGLPGPTGIPGPIGPPGPPGLMGPPGPPGLPGPKGNMGLNFQ
GPKGEKGEQGLQGPPGP
PGQISEQKRPIDVEFQKGDQGLPGDRGPPGPPGIRGPPGPPGGEKGEKGEQGEPGKRGKP
GKDGENGQPGIPGLPGDPGYPGEPGRDGEKGQKGDTGPPGPPGLVIPRPGTG
ITIGEKGN
IGLPGLPGEKGERGFPGIQGPPGLPGPPGAAVMGPPGPPGFPGERGQKGDEGPPGISIPG
PPGLDGQPGAPGLPGPPGPAGPHIPPSDE
ICEPGPPGPPGSPGDKGLQGEQGVKGDKGDT
CFNCIGTGISGPPGQPGLPGLPGPPGSLGFPGQKGEKGQAGATGPKGLPGIPGAPGAPGF
PGSKGEPGDI
LTFPGMKGDKGELGSPGAPGLPGLPGTPGQDGLPGLPGPKGEPGGITFKG
ERGPPGNPGLPGLPGNIGPMGPPGFGPPGPVGEKGIQGVAGNPGQPGIPGPKGDPGQT
IT
QPGKPGLPGNPGRDGDVGLPGDPGLPGQPGLPGIPGSKGEPGIPGIGLPGPPGPKGFPGI
PGPPGAPGTPGRIGLEGPPGPPGFPGPKGEPGFALPGPPGPPGLPGFKGALGPKGDRGFP
GPPGPPGRTGLDGLPGPKGDVGPNGQPGPMGPPGLPGIGVQGPPGPPGIPGPIGQPGLHG
IPGEKGDPGPPGLDVPGPPGERGSPGIPGAPGPIGPPGSPGLPGKAGASGFPGTKGEMGM
MGPPGPPGP
LGIPGRSGVPGLKGDDGLQGQPGLPGPTGEKGSKGEPGLPGPPGPMDPN
LL
GSKGEKGEPGLPGIPGVSGPKGYQGLPGDPGQPGLSGQPGLPGPPGPKGNPG
LPGQPGLI
GPPGLKGTIGDMGFPGPQGVEGPPGPSGVPGQPGSPGLPGQKGDKGDPGISS
IGLPGLPG
PKGEPGLPGYPGNPGIKGSVGDPGLPGLPGTPGAKGQPGLPGFPGTP
GPPGPKGISGPPG
NPGLPGEPGPVGGGGHPGQPGPPGEKGKPGQDGIPGPAGQKGEPGQPGF
GNPGPPGLPGL
SGQKGDGGLPGIPGNPGLPGPKGEPGFHGFPGVQGPPGPPGSPGP
ALEGPKGNPGPQGPP
GRPGLPGPEGPPGLPGNGGIKGEKGNPGQPGLPGLPGLKGDQGPPGLQGNPGRPG
LNGMK
GDPGLPGVPGFPGMKGPSGVPGSAGPEGEPGLIGPPGPPGLPGPSGQSIIIKGDAGPPGI
PGQPGLKGLPGPQGPQGLPGPTGPPGDPGRNGLPGFDGAGGRKGDPGLPGQPGTRGLDGP
PGPDGLQGPPGPPGTSSVAH
GFLITRHSQTTDAPQCPQGTLQVYEGFSLLYVQGNKRAHG
QDLGTAGSCLRRFSTMPFMFCNINNVCNFASRNDYSYWLSTPEPMPMSMQPLKGQSIQPF
ISRCAVCEA
PAVVIAVHSQTIQIPHCPQGWDSLWIGYSFMMHTSAGAEGSGQALASPGSC
LEEFRSAPFIECHGRGTCNYYANSYSFWLATVDVSDMFSKPQSETLKAGDLRTRISRCQV
CMK
RT
Sequence length 1685
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG   Reactome
  PI3K-Akt signaling pathway
Focal adhesion
ECM-receptor interaction
Cytoskeleton in muscle cells
Relaxin signaling pathway
AGE-RAGE signaling pathway in diabetic complications
Protein digestion and absorption
Amoebiasis
Human papillomavirus infection
Pathways in cancer
Small cell lung cancer
  Collagen degradation
Extracellular matrix organization
Collagen biosynthesis and modifying enzymes
Signaling by PDGF
Assembly of collagen fibrils and other multimeric structures
Integrin cell surface interactions
Anchoring fibril formation
Crosslinking of collagen fibrils
Laminin interactions
Non-integrin membrane-ECM interactions
NCAM1 interactions
Collagen chain trimerization
<
Associated diseases Disease associations categorized as Causal (pathogenic variants), Unknown (uncertain genetic evidence), or Text Mining (literature-based associations)
Causal Diseases caused by PATHOGENIC or LIKELY PATHOGENIC variants from ClinVar only
Disease merge term Disease name dbSNP ID References
Alport Syndrome alport syndrome, autosomal dominant alport syndrome rs104886071, rs104886189, rs1569492951, rs281874670, rs104886247, rs104886142, rs104886101, rs104886079, rs281874753, rs1569494378, rs1603290681, rs104886286, rs281874673, rs1556420349, rs104886440
View all (3 more)
N/A
Alport Syndrome, X-Linked x-linked alport syndrome rs104886174, rs2066231013, rs1569491399, rs281874667, rs281874702, rs1603293553, rs104886059, rs104886288, rs1603290169, rs104886189, rs797045035, rs1603290131, rs104886388, rs104886372, rs104886130
View all (248 more)
N/A
focal segmental glomerulosclerosis Focal segmental glomerulosclerosis rs2066568818 N/A
hearing impairment Hearing impairment rs281874743 N/A
Associations from Text Mining Disease associations identified through Pubtator
Disease Name Relationship Type References
Adrenoleukodystrophy Associate 24854265
Anterior Spinal Artery Syndrome Associate 10321542, 25110662
Anti Glomerular Basement Membrane Disease Associate 31686460, 36130833, 8196274, 8264140
Atrophy Associate 38139322
Carcinoma Basal Cell Associate 34104083
Carcinoma Renal Cell Associate 33838279
Cardiomyopathy Dilated Associate 27936202
Chromosome Deletion Associate 9598718
Chronic Kidney Disease Mineral and Bone Disorder Associate 31085678, 36128480
Corneal Dystrophy Fleck Associate 25110662