Gene Gene information from NCBI Gene database.
Entrez ID 1284
Gene name Collagen type IV alpha 2 chain
Gene symbol COL4A2
Synonyms (NCBI Gene)
BSVD2ICHPOREN2
Chromosome 13
Chromosome location 13q34
Summary This gene encodes one of the six subunits of type IV collagen, the major structural component of basement membranes. The C-terminal portion of the protein, known as canstatin, is an inhibitor of angiogenesis and tumor growth. Like the other members of the
SNPs SNP information provided by dbSNP.
15
SNP ID Visualize variation Clinical significance Consequence
rs12877501 G>A,C Likely-pathogenic, likely-benign Missense variant, coding sequence variant
rs62621875 C>A Risk-factor, benign, likely-benign Missense variant, coding sequence variant
rs117412802 A>G Likely-benign, risk-factor, benign Missense variant, coding sequence variant
rs201105747 G>A,T Risk-factor, likely-benign, benign Coding sequence variant, missense variant
rs387906602 G>A Pathogenic Coding sequence variant, missense variant
miRNA miRNA information provided by mirtarbase database.
275
miRTarBase ID miRNA Experiments Reference
MIRT001926 hsa-miR-29c-3p Luciferase reporter assay 18390668
MIRT001926 hsa-miR-29c-3p Luciferase reporter assay 18390668
MIRT001926 hsa-miR-29c-3p Luciferase reporter assay 18390668
MIRT001926 hsa-miR-29c-3p Luciferase reporter assay 18390668
MIRT001926 hsa-miR-29c-3p Luciferase reporter assay 18390668
Transcription factors Transcription factors information provided by TRRUST V2 database.
1
Transcription factor Regulation Reference
VHL Unknown 17700531
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
32
GO ID Ontology Definition Evidence Reference
GO:0001525 Process Angiogenesis IEA
GO:0005201 Function Extracellular matrix structural constituent IEA
GO:0005201 Function Extracellular matrix structural constituent TAS 8317999
GO:0005515 Function Protein binding IPI 12011424
GO:0005576 Component Extracellular region IEA
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
120090 2203 ENSG00000134871
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
P08572
Protein name Collagen alpha-2(IV) chain [Cleaved into: Canstatin]
Protein function Type IV collagen is the major structural component of glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen.; Canstatin, a cleavage product corresponding to th
PDB 1LI1 , 5NAX , 5NB2 , 6MPX
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF01391 Collagen 56 118 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 112 173 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 182 234 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 291 353 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 423 483 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 491 552 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 679 740 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 720 778 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 777 838 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 812 892 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 862 938 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 920 981 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1032 1096 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1098 1157 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1152 1214 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1278 1340 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1332 1396 Collagen triple helix repeat (20 copies) Repeat
PF01413 C4 1490 1595 C-terminal tandem repeated domain in type 4 procollagen Domain
PF01413 C4 1598 1710 C-terminal tandem repeated domain in type 4 procollagen Domain
Sequence
MGRDQRAVAGPALRRWLLLGTVTVGFLAQSVLAGVKKFDVPCGGRDCSGGCQCYPEKGGR
GQPGPVGPQGYNGPPGLQGFPGLQGRKGDKGERGAPGVTGPKGDVGARGVS
GFPGADGIP
GHPGQGGPRGRPGYDGCNGTQGDSGPQGPPGSEGFTGPPGPQGPKGQKGEPYA
LPKEERD
RYRGEPGEPGLVGFQGPPGRPGHVGQMGPVGAPGRPGPPGPPGPKGQQGNRGLGFYGVKG
EKGDVGQPGPNGIPSDTLHPIIAPTGVTFHPDQYKGEKGSEGEPGIRGISLKGEEGIMGF
PGLRGYPGLSGEKGSPGQKGSRGLDGYQGPDGPRGPKGEAGDPGPPGLPAYSP
HPSLAKG
ARGDPGFPGAQGEPGSQGEPGDPGLPGPPGLSIGDGDQRRGLPGEMGPKGFIGDPGIPAL
YGGPPGPDGKRGPPGPPGLPGPPGPDGFLFGLKGAKGRAGFPGLPGSPGARGPKGWKGDA
GEC
RCTEGDEAIKGLPGLPGPKGFAGINGEPGRKGDRGDPGQHGLPGFPGLKGVPGNIGA
PGPKGAKGDSRT
ITTKGERGQPGVPGVPGMKGDDGSPGRDGLDGFPGLPGPPGDGIKGPP
GDPGYPGIPGTKGTPGEMGPPGLGLPGLKGQRGFPGDAGLPGPPGFLGPPGPAGTPGQID
CDTDVKRAVGGDRQEAIQPGCIGGPKGLPGLPGPPGPTGAKGLRGIPGFAGADGGPGPRG
LPGDAGREGFPGPPGFIGPR
GSKGAVGLPGPDGSPGPIGLPGPDGPPGERGLPGEVLGAQ
PGPRGDAGVPGQPGLKGLPGDRGPPGFRGSQGMPGMPGLKGQPGLPGPSGQPGLYGPPGL
HGFPGAPGQEGPLGLPGIPGREGLPGDRGDPGDTGAPGPVGMKGLSGDRGDAGFTGEQGH
PGSPGFKGIDGMPGTPGLKGDRGSPGMDGFQGMPGLKGRPGFPGSKGEAGFFGIPGLKGL
AGEPGFKGSRGDPGPPGPPPV
ILPGMKDIKGEKGDEGPMGLKGYLGAKGIQGMPGIPGLS
GIPGLPGRPGHIKGVKGDIGVPGIPGLPGFPGVAGPPGITGFPGFIGSRGDKGAPGRAGL
YGEIGATGDFGDIGDT
INLPGRPGLKGERGTTGIPGLKGFFGEKGTEGDIGFPGITGVTG
VQGPPGLKGQT
GFPGLTGPPGSQGELGRIGLPGGKGDDGWPGAPGLPGFPGLRGIRGLHG
LPGTKGFPGSPGSD
IHGDPGFPGPPGERGDPGEANTLPGPVGVPGQKGDQGAPGERGPPG
SPGLQGFPGITPPSNISGAPGDKGAPGIFGLKGYRGPPGPPGSAALPGSKGDTGNPGAPG
TPGTKGWAGDS
GPQGRPGVFGLPGEKGPRGEQGFMGNTGPTGAVGDRGPKGPKGDPGFPG
APGTVGAPGIAGIPQK
IAVQPGTVGPQGRRGPPGAPGEMGPQGPPGEPGFRGAPGKAGPQ
GRGGVSAVPGFRGDEGPIGHQGPIGQEGAPGRPGSPGLPGMPGRSVSIGYLLVKHSQTDQ
EPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDVCYYASR
NDKSYWLSTTAPLPMMPVAEDEIKPYISRCSVCEA
PAIAIAVHSQDVSIPHCPAGWRSLW
IGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHYYANKYSFWLTTIP
EQSFQGSPSADTLKAGLIRTHISRCQVCMK
NL
Sequence length 1712
Interactions View interactions
Associated diseases Disease associations from ClinVar categorized as Causal (Pathogenic/Likely Pathogenic) or Unknown.
594
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Cerebral palsy Likely pathogenic rs1013146465, rs2139538791, rs2139537197 RCV001796565
RCV001796562
RCV001849844
COL4A2-related disorder Likely pathogenic rs2502050500, rs2502120134 RCV003123296
RCV004552404
Intellectual disability Likely pathogenic rs2139537197 RCV001849844
Intraventricular hemorrhage Likely pathogenic rs2139553210 RCV001391266
Unknown Diseases with uncertain, conflicting, or no pathogenic evidence in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Adrenocortical carcinoma, hereditary Benign; Likely benign rs2296849, rs3803230, rs147765396 RCV005893203
RCV005893196
RCV005903684
Bethlem myopathy 2 Conflicting classifications of pathogenicity rs2502163650 RCV005863876
BRAIN SMALL VESSEL DISEASE 2B, AUTOSOMAL RECESSIVE Uncertain significance rs751015466 RCV006255197
Cervical cancer Benign; Likely benign rs147765396, rs147210424 RCV005903685
RCV005904459
Associations from Text MiningDisease associations identified through Pubtator
Disease Name Relationship Type References
13q deletion syndrome Associate 28393221
Aortic Dissection Associate 32062133
Astrocytoma Associate 12937144
Atherosclerosis Associate 27389912
Blood Platelet Disorders Associate 35150448
Brain Diseases Associate 36811765
Brain Injuries Diffuse Associate 32515830
Brain Neoplasms Associate 26277786
Breast Neoplasms Associate 27891193, 31613058
Carcinoma Hepatocellular Associate 31905170