Gene Gene information from NCBI Gene database.
Entrez ID 1287
Gene name Collagen type IV alpha 5 chain
Gene symbol COL4A5
Synonyms (NCBI Gene)
ASLNATSATS1CA54
Chromosome X
Chromosome location Xq22.3
Summary This gene encodes one of the six subunits of type IV collagen, the major structural component of basement membranes. Mutations in this gene are associated with X-linked Alport syndrome, also known as hereditary nephritis. Like the other members of the typ
SNPs SNP information provided by dbSNP.
774
SNP ID Visualize variation Clinical significance Consequence
rs5973838 C>A,G,T Pathogenic Intron variant
rs104886042 G>- Pathogenic Coding sequence variant, frameshift variant, genic upstream transcript variant
rs104886043 G>A Pathogenic Coding sequence variant, missense variant, genic upstream transcript variant
rs104886044 G>- Pathogenic Coding sequence variant, frameshift variant, genic upstream transcript variant
rs104886045 C>- Pathogenic Coding sequence variant, frameshift variant, genic upstream transcript variant
miRNA miRNA information provided by mirtarbase database.
124
miRTarBase ID miRNA Experiments Reference
MIRT018772 hsa-miR-335-5p Microarray 18185580
MIRT030043 hsa-miR-26b-5p Microarray 19088304
MIRT053738 hsa-miR-29b-3p Microarray 22942087
MIRT450890 hsa-miR-199a-3p PAR-CLIP 22100165
MIRT450889 hsa-miR-199b-3p PAR-CLIP 22100165
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
19
GO ID Ontology Definition Evidence Reference
GO:0005201 Function Extracellular matrix structural constituent IEA
GO:0005515 Function Protein binding IPI 32296183, 32814053
GO:0005576 Component Extracellular region IEA
GO:0005576 Component Extracellular region TAS
GO:0005581 Component Collagen trimer IEA
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
303630 2207 ENSG00000188153
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
P29400
Protein name Collagen alpha-5(IV) chain
Protein function Type IV collagen is the major structural component of glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen.
PDB 5NAZ , 6WKU
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF01391 Collagen 33 118 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 81 164 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 165 223 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 283 352 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 391 449 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 491 550 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 598 659 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 659 706 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 706 766 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 754 818 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 796 854 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 854 909 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 896 958 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 960 1019 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1013 1072 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1074 1133 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1128 1189 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1190 1248 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1246 1315 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1400 1460 Collagen triple helix repeat (20 copies) Repeat
PF01413 C4 1462 1569 C-terminal tandem repeated domain in type 4 procollagen Domain
PF01413 C4 1572 1683 C-terminal tandem repeated domain in type 4 procollagen Domain
Tissue specificity TISSUE SPECIFICITY: Isoform 2 is found in kidney.
Sequence
MKLRGVSLAAGLFLLALSLWGQPAEAAACYGCSPGSKCDCSGIKGEKGERGFPGLEGHPG
LPGFPGPEGPPGPRGQKGDD
GIPGPPGPKGIRGPPGLPGFPGTPGLPGMPGHDGAPGPQG
IPGCNGTKGERGFPGSPGFPGLQGPPGPPGIPGMKGEPGSIIMS
SLPGPKGNPGYPGPPG
IQGLPGPTGIPGPIGPPGPPGLMGPPGPPGLPGPKGNMGLNFQ
GPKGEKGEQGLQGPPGP
PGQISEQKRPIDVEFQKGDQGLPGDRGPPGPPGIRGPPGPPGGEKGEKGEQGEPGKRGKP
GKDGENGQPGIPGLPGDPGYPGEPGRDGEKGQKGDTGPPGPPGLVIPRPGTG
ITIGEKGN
IGLPGLPGEKGERGFPGIQGPPGLPGPPGAAVMGPPGPPGFPGERGQKGDEGPPGISIPG
PPGLDGQPGAPGLPGPPGPAGPHIPPSDE
ICEPGPPGPPGSPGDKGLQGEQGVKGDKGDT
CFNCIGTGISGPPGQPGLPGLPGPPGSLGFPGQKGEKGQAGATGPKGLPGIPGAPGAPGF
PGSKGEPGDI
LTFPGMKGDKGELGSPGAPGLPGLPGTPGQDGLPGLPGPKGEPGGITFKG
ERGPPGNPGLPGLPGNIGPMGPPGFGPPGPVGEKGIQGVAGNPGQPGIPGPKGDPGQT
IT
QPGKPGLPGNPGRDGDVGLPGDPGLPGQPGLPGIPGSKGEPGIPGIGLPGPPGPKGFPGI
PGPPGAPGTPGRIGLEGPPGPPGFPGPKGEPGFALPGPPGPPGLPGFKGALGPKGDRGFP
GPPGPPGRTGLDGLPGPKGDVGPNGQPGPMGPPGLPGIGVQGPPGPPGIPGPIGQPGLHG
IPGEKGDPGPPGLDVPGPPGERGSPGIPGAPGPIGPPGSPGLPGKAGASGFPGTKGEMGM
MGPPGPPGP
LGIPGRSGVPGLKGDDGLQGQPGLPGPTGEKGSKGEPGLPGPPGPMDPN
LL
GSKGEKGEPGLPGIPGVSGPKGYQGLPGDPGQPGLSGQPGLPGPPGPKGNPG
LPGQPGLI
GPPGLKGTIGDMGFPGPQGVEGPPGPSGVPGQPGSPGLPGQKGDKGDPGISS
IGLPGLPG
PKGEPGLPGYPGNPGIKGSVGDPGLPGLPGTPGAKGQPGLPGFPGTP
GPPGPKGISGPPG
NPGLPGEPGPVGGGGHPGQPGPPGEKGKPGQDGIPGPAGQKGEPGQPGF
GNPGPPGLPGL
SGQKGDGGLPGIPGNPGLPGPKGEPGFHGFPGVQGPPGPPGSPGP
ALEGPKGNPGPQGPP
GRPGLPGPEGPPGLPGNGGIKGEKGNPGQPGLPGLPGLKGDQGPPGLQGNPGRPG
LNGMK
GDPGLPGVPGFPGMKGPSGVPGSAGPEGEPGLIGPPGPPGLPGPSGQSIIIKGDAGPPGI
PGQPGLKGLPGPQGPQGLPGPTGPPGDPGRNGLPGFDGAGGRKGDPGLPGQPGTRGLDGP
PGPDGLQGPPGPPGTSSVAH
GFLITRHSQTTDAPQCPQGTLQVYEGFSLLYVQGNKRAHG
QDLGTAGSCLRRFSTMPFMFCNINNVCNFASRNDYSYWLSTPEPMPMSMQPLKGQSIQPF
ISRCAVCEA
PAVVIAVHSQTIQIPHCPQGWDSLWIGYSFMMHTSAGAEGSGQALASPGSC
LEEFRSAPFIECHGRGTCNYYANSYSFWLATVDVSDMFSKPQSETLKAGDLRTRISRCQV
CMK
RT
Sequence length 1685
Interactions View interactions
Associated diseases Disease associations from ClinVar (causal & non-causal) and other databases (OMIM, Orphanet, GWAS, etc.).
55
Evidence Score: ★☆☆☆☆  Gene-disease association found in Text Mining only ★★☆☆☆  Found in Text Mining and Unknown/Other Associations ★★★☆☆  Reported in Unknown/Other Associations across ≥2 Sources ★★★★☆  ClinVar: Pathogenic/Likely Pathogenic (<5 Variants) ★★★★★  ClinVar: Pathogenic/Likely Pathogenic (≥5 Variants)
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession Evidence Score
Alport syndrome Likely pathogenic; Pathogenic rs2147975020, rs1353480777, rs2147777495, rs2147959392, rs104886299, rs104886303, rs2524632147, rs2524632703, rs2524305677, rs104886440, rs104886071, rs104886079, rs104886101, rs281874670, rs281874673
View all (33 more)
RCV002471103
RCV001795866
RCV001795869
RCV001795871
RCV001849648
View all (43 more)
★★★★★
ClinVar: Pathogenic / Likely Pathogenic (≥5 Variants)
Alport syndrome 1 Pathogenic rs104886245 RCV005627439
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
Atypical hemolytic-uremic syndrome Likely pathogenic; Pathogenic rs104886189 RCV001328145
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
Autosomal dominant Alport syndrome Likely pathogenic; Pathogenic rs2147849728, rs104886312, rs104886189 RCV001849658
RCV001849660
RCV001849275
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
Unknown / Other Associations ClinVar entries with uncertain/conflicting evidence, and associations from other databases (OMIM, Orphanet, GWAS, etc.) where the gene is not established as causal.
Phenotype Name Clinical Significance Source Evidence Score
22Q13.3 DELETION SYNDROME Disgenet
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Acute myeloid leukemia Benign; Likely benign ClinVar
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
ATYPICAL HEMOLYTIC UREMIC SYNDROME Disgenet
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
CHRONIC KIDNEY DISEASES Disgenet
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Associations from Text Mining Disease associations identified through text mining
Disease Name Disease (Merged) Source PMID Relationship Type Evidence Score
Adenocarcinoma Adenocarcinoma LHGDN 15211113
★☆☆☆☆
Found in Text Mining only
Adenocarcinoma Adenocarcinoma BEFREE 23564952, 25833149, 28593420, 28616268, 28940943
★☆☆☆☆
Found in Text Mining only
Adenocarcinoma of lung (disorder) Lung adenocarcinoma BEFREE 23076555, 23486266, 24423157, 25029118, 25236981, 25550848, 25833149, 26787602, 28432617, 28940943, 29228652, 29368062, 29952952, 30352093, 30833014
★☆☆☆☆
Found in Text Mining only
Adrenoleukodystrophy Adrenoleukodystrophy Pubtator 24854265 Associate
★☆☆☆☆
Found in Text Mining only
Alport Syndrome Alport Syndrome BEFREE 10049589, 10094548, 10200983, 10499074, 10684360, 10752524, 10862091, 11044206, 11223851, 11773466, 12105244, 12203217, 12436246, 12478350, 12796257
View all (106 more)
★★★★★
ClinVar: Pathogenic / Likely Pathogenic (≥5 Variants)
Alport Syndrome Alport Syndrome CLINGEN_DG 10563487, 14993485, 15153557, 16301374, 23085274, 25572247, 27725546, 27796712, 28542346, 30582011
★★★★★
ClinVar: Pathogenic / Likely Pathogenic (≥5 Variants)
Alport Syndrome Alport Syndrome GENOMICS_ENGLAND_DG 14514738, 29270492
★★★★★
ClinVar: Pathogenic / Likely Pathogenic (≥5 Variants)
Alport Syndrome Alport Syndrome CLINVAR_DG
★★★★★
ClinVar: Pathogenic / Likely Pathogenic (≥5 Variants)
ALPORT SYNDROME 1, X-LINKED Alport Syndrome UNIPROT_DG 10094548, 10561141, 10563487, 10684360, 10862091, 11004279, 11223851, 1352287, 1363780, 1376965, 1672282, 24522658, 7599631, 7853788, 8406498
View all (7 more)
★☆☆☆☆
Found in Text Mining only
ALPORT SYNDROME 1, X-LINKED Alport Syndrome CLINVAR_DG 10094548, 10561141, 10684360, 10862091, 10955921, 11223851, 11462238, 12105244, 12436246, 15780079, 15954103, 1598909, 16941480, 17396119, 17660027
View all (34 more)
★☆☆☆☆
Found in Text Mining only