Gene
Entrez ID Entrez Gene ID - the GENE ID in NCBI Gene database.
1286
Gene name Gene Name - the full gene name approved by the HGNC.
Collagen type IV alpha 4 chain
Gene symbol Gene Symbol - the official gene symbol approved by the HGNC.
COL4A4
Synonyms (NCBI Gene) Gene synonyms aliases
ATS2, BFH, BFH1, CA44
Disease Acronyms (UniProt) Disease acronyms from UniProt database
ATS2, BFH1
Chromosome Chromosome number
2
Chromosome location Chromosomal Location - indicates the cytogenetic location of the gene or region on the chromosome.
2q36.3
Summary Summary of gene provided in NCBI Entrez Gene.
This gene encodes one of the six subunits of type IV collagen, the major structural component of basement membranes. This particular collagen IV subunit, however, is only found in a subset of basement membranes. Like the other members of the type IV colla
SNPs SNP information provided by dbSNP.
SNP ID Visualize variation Clinical significance Consequence
rs35138315 G>C Pathogenic Coding sequence variant, stop gained, non coding transcript variant
rs55978207 C>A,T Likely-benign, conflicting-interpretations-of-pathogenicity Synonymous variant, coding sequence variant, non coding transcript variant
rs114969026 T>C Conflicting-interpretations-of-pathogenicity, benign Non coding transcript variant, genic upstream transcript variant, coding sequence variant, missense variant
rs121912858 C>T Pathogenic Non coding transcript variant, coding sequence variant, missense variant
rs121912859 G>T Pathogenic Non coding transcript variant, stop gained, coding sequence variant
miRNA miRNA information provided by mirtarbase database.
miRTarBase ID miRNA Experiments Reference
MIRT022562 hsa-miR-124-3p Microarray 18668037
MIRT029341 hsa-miR-26b-5p Microarray 19088304
MIRT437823 hsa-miR-150-5p Luciferase reporter assay 23604143
MIRT659421 hsa-miR-548c-3p HITS-CLIP 23824327
MIRT659420 hsa-miR-216b-5p HITS-CLIP 23824327
Transcription factors
Transcription factor Regulation Reference
LMX1B Activation 11175791
Gene ontology (GO) Gene ontology information of associated ontologies with gene provided by GO database.
GO ID Ontology Definition Evidence Reference
GO:0005201 Function Extracellular matrix structural constituent IBA 21873635
GO:0005201 Function Extracellular matrix structural constituent IMP 17942953, 19675380
GO:0005576 Component Extracellular region TAS
GO:0005587 Component Collagen type IV trimer IBA 21873635
GO:0005587 Component Collagen type IV trimer IDA 7523402
Other IDs Other ids provides unique ids of gene in databases such as OMIM, HGNC, ENSEMBLE.
MIM HGNC e!Ensembl
120131 2206 ENSG00000081052
Protein
UniProt ID P53420
Protein name Collagen alpha-4(IV) chain
Protein function Type IV collagen is the major structural component of glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen.
PDB 5NB1 , 6WKU
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF01391 Collagen 62 121 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 118 176 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 181 238 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 296 360 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 365 428 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 400 457 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 494 559 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 906 967 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 969 1029 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1015 1080 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1082 1141 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1133 1195 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1196 1253 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1313 1372 Collagen triple helix repeat (20 copies) Repeat
PF01413 C4 1466 1571 C-terminal tandem repeated domain in type 4 procollagen Domain
PF01413 C4 1574 1688 C-terminal tandem repeated domain in type 4 procollagen Domain
Tissue specificity TISSUE SPECIFICITY: Expressed in Bruch's membrane, outer plexiform layer, inner nuclear layer, inner plexiform layer, ganglion cell layer, inner limiting membrane and around the blood vessels of the retina (at protein level) (PubMed:29777959). Alpha 3 and
Sequence
MWSLHIVLMRCSFRLTKSLATGPWSLILILFSVQYVYGSGKKYIGPCGGRDCSVCHCVPE
KGSRGPPGPPGPQGPIGPLGAPGPIGLSGEKGMRGDRGPPGAAGDKGDKGPTGVPGFPGL
D
GIPGHPGPPGPRGKPGMSGHNGSRGDPGFPGGRGALGPGGPLGHPGEKGEKGNSV
FILG
AVKGIQGDRGDPGLPGLPGSWGAGGPAGPTGYPGEPGLVGPPGQPGRPGLKGNPGVGVKG
QMGDPGEVGQQGSPGPTLLVEPPDFCLYKGEKGIKGIPGMVGLPGPPGRKGESGIGAKGE
KGIPGFPGPRGDPGSYGSPGFPGLKGELGLVGDPGLFGLIGPKGDPGNRGHPGPPGVLVT

PPLPLKGPPGDPGFPGRYGETGDVGPPGPPGLLGRPGEACAGMIGPPGPQGFPGLPGLPG
EAGIPGRP
DSAPGKPGKPGSPGLPGAPGLQGLPGSSV
IYCSVGNPGPQGIKGKVGPPGGR
GPKGEKGNEGLCACEPGPMGPPGPPGLPGRQGSKGDLGLPGWLGTKGDPGPPGAEGPPGL
PGKHGASGPPGNKGAKGDM
VVSRVKGHKGERGPDGPPGFPGQPGSHGRDGHAGEKGDPGP
PGDHEDATPGGKGFPGPLGPPGKAGPVGPPGLGFPGPPGERGHPGVPGHPGVRGPDGLKG
QKGDTISCNVTYPGRHGPPGFDGPPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTAEIPGP
PGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPGDPAFGHLGPPGKRGLSGVPG
IKGPRGDPGCPGAEGPAGIPGFLGLKGPKGREGHAGFPGVPGPPGHSCERGAPGIPGQPG
LPGYPGSPGAPGGKGQPGDVGPPGPAGMKGLPGLPGRPGAHGPPGLPGIPGPFGDDGLPG
PPGPKGPRGLPGFPGFPGERGKPGAEGCPGAKGEPGEKGMSGLPGDRGLRGAKGAIGPPG
DEGEMAI
ISQKGTPGEPGPPGDDGFPGERGDKGTPGMQGRRGEPGRYGPPGFHRGEPGEK
GQPGPPGPP
GPPGSTGLRGFIGFPGLPGDQGEPGSPGPPGFSGIDGARGPKGNKGDPASH

FGPPGPKGEPGSPGCPGHFGASGEQGLPGIQGPRGSPGRPGPPGSSGPPGCPGDHGMPGL
R
GQPGEMGDPGPRGLQGDPGIPGPPGIKGPSGSPGLNGLHGLKGQKGTKGASGLH
DVGPP
GPVGIPGLKGERGDPGSPGISPPGPRGKKGPPGPPGSSGPPGPAGATGRAPKD
IPDPGPP
GDQGPPGPDGPRGAPGPPGLPGSVDLLRGEPGDCGLPGPPGPPGPPGPPGYKGFPGCDGK
DGQKGPVGFPGPQGPHGFPGPPGEKGLPGPPGRKGPTGLPGPRGEPGPPADV
DDCPRIPG
LPGAPGMRGPEGAMGLPGMRGPSGPGCKGEPGLDGRRGVDGVPGSPGPPGRKGDTGEDGY
PGGPGPPGPIGDPGPKGFGPGYLGGFLLVLHSQTDQEPTCPLGMPRLWTGYSLLYLEGQE
KAHNQDLGLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLASAAPLPMMPLSEEAIR
PYVSRCAVCEA
PAQAVAVHSQDQSIPPCPQTWRSLWIGYSFLMHTGAGDQGGGQALMSPG
SCLEDFRAAPFLECQGRQGTCHFFANKYSFWLTTVKADLQFSSAPAPDTLKESQAQRQKI
SRCQVCVK
YS
Sequence length 1690
Interactions View interactions
Associated diseases Disease information provided by ClinVar, GenCC, and GWAS databases.
Unknown
Disease term Disease name Evidence References Source
Hematuria hematuria, benign familial, 1 GenCC
Alport Syndrome Alport syndrome GenCC
Coronary artery disease Coronary artery disease GWAS
Myocardial Infarction Myocardial Infarction GWAS
Associations from Text Mining
Disease Name Relationship Type References
Albuminuria Associate 37872228
Alzheimer Disease Associate 11044206
Androgen Insensitivity Syndrome Associate 11044206
Anti Glomerular Basement Membrane Disease Associate 31686460, 36130833
Carcinoma Renal Cell Associate 31746384, 35529267
Central Serous Chorioretinopathy Associate 35052395
Chronic Kidney Disease Mineral and Bone Disorder Associate 31085678
Collagen Diseases Associate 27469977, 33159707
Endometrial Neoplasms Associate 36581816
Epilepsy Benign Neonatal Associate 19129241