Gene Gene information from NCBI Gene database.
Entrez ID 1282
Gene name Collagen type IV alpha 1 chain
Gene symbol COL4A1
Synonyms (NCBI Gene)
BSVDBSVD1COL4A1sPADMALRATOR
Chromosome 13
Chromosome location 13q34
Summary This gene encodes a type IV collagen alpha protein. Type IV collagen proteins are integral components of basement membranes. This gene shares a bidirectional promoter with a paralogous gene on the opposite strand. The protein consists of an amino-terminal
SNPs SNP information provided by dbSNP.
73
SNP ID Visualize variation Clinical significance Consequence
rs75711155 A>C Likely-benign, conflicting-interpretations-of-pathogenicity Intron variant
rs113994104 C>A,T Pathogenic Missense variant, coding sequence variant
rs113994105 C>T Pathogenic Genic downstream transcript variant, missense variant, coding sequence variant
rs113994106 C>T Pathogenic Genic downstream transcript variant, missense variant, coding sequence variant
rs113994107 C>T Pathogenic Genic downstream transcript variant, missense variant, coding sequence variant
miRNA miRNA information provided by mirtarbase database.
468
miRTarBase ID miRNA Experiments Reference
MIRT001927 hsa-miR-29c-3p Luciferase reporter assay 18390668
MIRT001927 hsa-miR-29c-3p Luciferase reporter assay 18390668
MIRT003221 hsa-miR-29a-3p Luciferase reporter assayqRT-PCRWestern blot 20067797
MIRT003221 hsa-miR-29a-3p Luciferase reporter assayqRT-PCRWestern blot 20067797
MIRT003221 hsa-miR-29a-3p Luciferase reporter assayqRT-PCRWestern blot 20067797
Transcription factors Transcription factors information provided by TRRUST V2 database.
1
Transcription factor Regulation Reference
LMX1B Unknown 12602071
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
33
GO ID Ontology Definition Evidence Reference
GO:0001525 Process Angiogenesis IEA
GO:0001569 Process Branching involved in blood vessel morphogenesis IMP 20818663
GO:0005201 Function Extracellular matrix structural constituent IEA
GO:0005201 Function Extracellular matrix structural constituent IMP 20818663
GO:0005515 Function Protein binding IPI 12011424
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
120130 2202 ENSG00000187498
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
P02462
Protein name Collagen alpha-1(IV) chain [Cleaved into: Arresten]
Protein function Type IV collagen is the major structural component of glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen. ; Arresten, compris
PDB 1LI1 , 5NAX , 5NAY , 6MPX
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF01391 Collagen 37 105 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 61 138 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 97 162 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 167 225 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 273 334 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 474 533 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 539 592 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 643 690 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 689 737 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 736 801 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 837 896 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 882 941 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 943 1005 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 999 1058 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1057 1117 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1384 1443 Collagen triple helix repeat (20 copies) Repeat
PF01413 C4 1446 1553 C-terminal tandem repeated domain in type 4 procollagen Domain
PF01413 C4 1556 1667 C-terminal tandem repeated domain in type 4 procollagen Domain
Tissue specificity TISSUE SPECIFICITY: Highly expressed in placenta. {ECO:0000269|PubMed:10811134, ECO:0000269|PubMed:16481288}.
Sequence
MGPRLSVWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGVKGQKGERGLPGLQGVI
GFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYPGNPGLPGIPGQDGPPGPP
GIPGCNGTKGERGPLGPP
GLPGFAGNPGPPGLPGMKGDPGEI
LGHVPGMLLKGERGFPGI
PGTPGPPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQMGLSFQ
GPKGDKGDQGVSGPP
GVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGVGEKGEPGKPGPRGKPGKDGDKGEKGS
PGFPGEPGYPGLIGRQGPQGEKGEAGPPGPPGIV
IGTGPLGEKGERGYPGTPGPRGEPGP
KGFPGLPGQPGPPGLPVPGQAGAPGFPGERGEKGDRGFPGTSLPGPSGRDGLPGPPGSPG
PPGQPGYTNGIVECQPGPPGDQGPPGIPGQPGFIGEIGEKGQKGESCLICDIDGYRGPPG
PQGPPGEIGFPGQPGAKGDRGLPGRDGVAGVPGPQGTPGLIGQPGAKGEPGEF
YFDLRLK
GDKGDPGFPGQPGMPGRAGSPGRDGHPGLPGPKGSPGSVGLKGERGPPGGVG
FPGSRGDT
GPPGPPGYGPAGPIGDKGQAGFPGGPGSPGLPGPKGEPGKIVPLPGPPGAEGLPGSPGFP
GPQGDRGFPGTPGRPGLPGEKGAVGQPG
IGFPGPPGPKGVDGLPGDMGPPGTPGRPGFNG
LPGNPGVQGQKGEPGVGLPGLKGLPGLPGIPGTPGEKGSIGVPGVPGEHGAIGPPGLQGI
RGEPGPPGLPGSVGSPGVPGI
GPPGARGPPGGQGPPGLSGPPGIKGEKGFPGFPGLDMPG
PKGDKGAQGLPGITGQSGLPGLPGQQGAPGIPGFPGSKGEM
GVMGTPGQPGSPGPVGAPG
LPGEKGDHGFPGSSGPRGDPGLKGDKGDVGLPGKPGSMDKV
DMGSMKGQKGDQGEKGQIG
PIGEKGSRGDPGTPGVPGKDGQAGQPGQPGPKGDPGIS
GTPGAPGLPGPKGSVGGMGLPG
TPGEKGVPGIPGPQGSPGLPGDKGAKGEKGQAGPPGIGIPGLRGEKGDQGIAGFPGSPGE
KGEKGSIGIPGMPGSPGLKGSPGSVGYPGSPGLPGEK
GDKGLPGLDGIPGVKGEAGLPGT
PGPTGPAGQKGEPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSP
GIPGSKGEQGFMGPPGPQGQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPG
IDGVKGDKGNPGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKG
LPGLQGIKGDQGDQGVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGLPGPK
GQQGVTGLVGIPGPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGPPGTP
SVD
HGFLVTRHSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCLRKFSTM
PFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCEA
PAMVMAV
HSQTIQIPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG
TCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMR
RT
Sequence length 1669
Interactions View interactions
Associated diseases Disease associations from ClinVar categorized as Causal (Pathogenic/Likely Pathogenic) or Unknown.
2107
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Abnormal cerebral cortex morphology Likely pathogenic; Pathogenic rs766209938 RCV001391264
Abnormal cerebral morphology Pathogenic rs1566378788 RCV002275899
Abnormal corpus callosum morphology Pathogenic rs587780588 RCV001391278
Autosomal dominant familial hematuria-retinal arteriolar tortuosity-contractures syndrome Likely pathogenic; Pathogenic rs375477517, rs2139146298, rs1594535661, rs2139179173, rs2139146179, rs1877827904, rs1436175370, rs2139163150, rs672601346, rs2501749650, rs2501792333, rs2501800825, rs780492563, rs1879237220, rs112314120
View all (26 more)
RCV002246414
RCV002246469
RCV005005287
RCV001823797
RCV001837730
RCV002484524
RCV005006307
RCV002273284
RCV000989163
RCV002468826
RCV002468828
RCV002470464
RCV005010958
RCV005002854
RCV003142254
RCV003153199
RCV003322721
RCV005003655
RCV005013014
RCV003984909
RCV003991094
RCV003991357
RCV003991372
RCV000018961
RCV000018962
RCV000018963
RCV000018966
RCV000018967
RCV000018968
RCV004545953
RCV004555327
RCV002502498
RCV002248658
RCV003322607
RCV000989164
RCV000995720
RCV001281220
RCV001281219
RCV001281218
RCV001281217
RCV001251513
Unknown Diseases with uncertain, conflicting, or no pathogenic evidence in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Adrenocortical carcinoma, hereditary Benign; Likely benign rs150857429 RCV005891959
Anterior segment dysgenesis Conflicting classifications of pathogenicity rs878853070 RCV001200033
Cervical cancer Benign; Likely benign rs77875306, rs150857429, rs140722653, rs372969177 RCV005919685
RCV005891960
RCV005893175
RCV005906599
Chronic kidney disease Conflicting classifications of pathogenicity rs34004222 RCV001171323
Associations from Text MiningDisease associations identified through Pubtator
Disease Name Relationship Type References
13q deletion syndrome Associate 28393221
Adenocarcinoma Associate 33905434, 36452120
Alveolar capillary dysplasia Associate 38029923
Alzheimer Disease Associate 34551193, 37264161
Anemia Hemolytic Associate 24861536
Angiopathy Hereditary With Nephropathy Aneurysms And Muscle Cramps Associate 18160688, 19949034, 22522439, 25873210, 27190376, 29395486
Arthritis Rheumatoid Associate 29482039
Astrocytoma Associate 23737970
Atherosclerosis Associate 29482039
Autistic Disorder Associate 28846756, 32499604