Gene
Entrez ID Entrez Gene ID - the GENE ID in NCBI Gene database.
1284
Gene name Gene Name - the full gene name approved by the HGNC.
Collagen type IV alpha 2 chain
Gene symbol Gene Symbol - the official gene symbol approved by the HGNC.
COL4A2
Synonyms (NCBI Gene) Gene synonyms aliases
BSVD2, ICH, POREN2
Disease Acronyms (UniProt) Disease acronyms from UniProt database
BSVD2, ICH
Chromosome Chromosome number
13
Chromosome location Chromosomal Location - indicates the cytogenetic location of the gene or region on the chromosome.
13q34
Summary Summary of gene provided in NCBI Entrez Gene.
This gene encodes one of the six subunits of type IV collagen, the major structural component of basement membranes. The C-terminal portion of the protein, known as canstatin, is an inhibitor of angiogenesis and tumor growth. Like the other members of the
SNPs SNP information provided by dbSNP.
SNP ID Visualize variation Clinical significance Consequence
rs12877501 G>A,C Likely-pathogenic, likely-benign Missense variant, coding sequence variant
rs62621875 C>A Risk-factor, benign, likely-benign Missense variant, coding sequence variant
rs117412802 A>G Likely-benign, risk-factor, benign Missense variant, coding sequence variant
rs201105747 G>A,T Risk-factor, likely-benign, benign Coding sequence variant, missense variant
rs387906602 G>A Pathogenic Coding sequence variant, missense variant
miRNA miRNA information provided by mirtarbase database.
miRTarBase ID miRNA Experiments Reference
MIRT001926 hsa-miR-29c-3p Luciferase reporter assay 18390668
MIRT001926 hsa-miR-29c-3p Luciferase reporter assay 18390668
MIRT001926 hsa-miR-29c-3p Luciferase reporter assay 18390668
MIRT001926 hsa-miR-29c-3p Luciferase reporter assay 18390668
MIRT001926 hsa-miR-29c-3p Luciferase reporter assay 18390668
Transcription factors
Transcription factor Regulation Reference
VHL Unknown 17700531
Gene ontology (GO) Gene ontology information of associated ontologies with gene provided by GO database.
GO ID Ontology Definition Evidence Reference
GO:0001525 Process Angiogenesis IEA
GO:0005201 Function Extracellular matrix structural constituent IBA 21873635
GO:0005201 Function Extracellular matrix structural constituent TAS 8317999
GO:0005515 Function Protein binding IPI 12011424
GO:0005576 Component Extracellular region TAS
Other IDs Other ids provides unique ids of gene in databases such as OMIM, HGNC, ENSEMBLE.
MIM HGNC e!Ensembl
120090 2203 ENSG00000134871
Protein
UniProt ID P08572
Protein name Collagen alpha-2(IV) chain [Cleaved into: Canstatin]
Protein function Type IV collagen is the major structural component of glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen.; Canstatin, a cleavage product corresponding to th
PDB 1LI1 , 5NAX , 5NB2 , 6MPX
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF01391 Collagen 56 118 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 112 173 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 182 234 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 291 353 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 423 483 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 491 552 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 679 740 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 720 778 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 777 838 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 812 892 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 862 938 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 920 981 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1032 1096 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1098 1157 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1152 1214 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1278 1340 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1332 1396 Collagen triple helix repeat (20 copies) Repeat
PF01413 C4 1490 1595 C-terminal tandem repeated domain in type 4 procollagen Domain
PF01413 C4 1598 1710 C-terminal tandem repeated domain in type 4 procollagen Domain
Sequence
MGRDQRAVAGPALRRWLLLGTVTVGFLAQSVLAGVKKFDVPCGGRDCSGGCQCYPEKGGR
GQPGPVGPQGYNGPPGLQGFPGLQGRKGDKGERGAPGVTGPKGDVGARGVS
GFPGADGIP
GHPGQGGPRGRPGYDGCNGTQGDSGPQGPPGSEGFTGPPGPQGPKGQKGEPYA
LPKEERD
RYRGEPGEPGLVGFQGPPGRPGHVGQMGPVGAPGRPGPPGPPGPKGQQGNRGLGFYGVKG
EKGDVGQPGPNGIPSDTLHPIIAPTGVTFHPDQYKGEKGSEGEPGIRGISLKGEEGIMGF
PGLRGYPGLSGEKGSPGQKGSRGLDGYQGPDGPRGPKGEAGDPGPPGLPAYSP
HPSLAKG
ARGDPGFPGAQGEPGSQGEPGDPGLPGPPGLSIGDGDQRRGLPGEMGPKGFIGDPGIPAL
YGGPPGPDGKRGPPGPPGLPGPPGPDGFLFGLKGAKGRAGFPGLPGSPGARGPKGWKGDA
GEC
RCTEGDEAIKGLPGLPGPKGFAGINGEPGRKGDRGDPGQHGLPGFPGLKGVPGNIGA
PGPKGAKGDSRT
ITTKGERGQPGVPGVPGMKGDDGSPGRDGLDGFPGLPGPPGDGIKGPP
GDPGYPGIPGTKGTPGEMGPPGLGLPGLKGQRGFPGDAGLPGPPGFLGPPGPAGTPGQID
CDTDVKRAVGGDRQEAIQPGCIGGPKGLPGLPGPPGPTGAKGLRGIPGFAGADGGPGPRG
LPGDAGREGFPGPPGFIGPR
GSKGAVGLPGPDGSPGPIGLPGPDGPPGERGLPGEVLGAQ
PGPRGDAGVPGQPGLKGLPGDRGPPGFRGSQGMPGMPGLKGQPGLPGPSGQPGLYGPPGL
HGFPGAPGQEGPLGLPGIPGREGLPGDRGDPGDTGAPGPVGMKGLSGDRGDAGFTGEQGH
PGSPGFKGIDGMPGTPGLKGDRGSPGMDGFQGMPGLKGRPGFPGSKGEAGFFGIPGLKGL
AGEPGFKGSRGDPGPPGPPPV
ILPGMKDIKGEKGDEGPMGLKGYLGAKGIQGMPGIPGLS
GIPGLPGRPGHIKGVKGDIGVPGIPGLPGFPGVAGPPGITGFPGFIGSRGDKGAPGRAGL
YGEIGATGDFGDIGDT
INLPGRPGLKGERGTTGIPGLKGFFGEKGTEGDIGFPGITGVTG
VQGPPGLKGQT
GFPGLTGPPGSQGELGRIGLPGGKGDDGWPGAPGLPGFPGLRGIRGLHG
LPGTKGFPGSPGSD
IHGDPGFPGPPGERGDPGEANTLPGPVGVPGQKGDQGAPGERGPPG
SPGLQGFPGITPPSNISGAPGDKGAPGIFGLKGYRGPPGPPGSAALPGSKGDTGNPGAPG
TPGTKGWAGDS
GPQGRPGVFGLPGEKGPRGEQGFMGNTGPTGAVGDRGPKGPKGDPGFPG
APGTVGAPGIAGIPQK
IAVQPGTVGPQGRRGPPGAPGEMGPQGPPGEPGFRGAPGKAGPQ
GRGGVSAVPGFRGDEGPIGHQGPIGQEGAPGRPGSPGLPGMPGRSVSIGYLLVKHSQTDQ
EPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDVCYYASR
NDKSYWLSTTAPLPMMPVAEDEIKPYISRCSVCEA
PAIAIAVHSQDVSIPHCPAGWRSLW
IGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHYYANKYSFWLTTIP
EQSFQGSPSADTLKAGLIRTHISRCQVCMK
NL
Sequence length 1712
Interactions View interactions
Associated diseases Disease information provided by ClinVar, GenCC, and GWAS databases.
Unknown
Disease term Disease name Evidence References Source
Porencephaly familial porencephaly, porencephaly 2 GenCC
Coronary artery disease Coronary artery disease GWAS
Myocardial Infarction Myocardial Infarction GWAS
Coronary Heart Disease Coronary Heart Disease GWAS
Associations from Text Mining
Disease Name Relationship Type References
13q deletion syndrome Associate 28393221
Aortic Dissection Associate 32062133
Astrocytoma Associate 12937144
Atherosclerosis Associate 27389912
Blood Platelet Disorders Associate 35150448
Brain Diseases Associate 36811765
Brain Injuries Diffuse Associate 32515830
Brain Neoplasms Associate 26277786
Breast Neoplasms Associate 27891193, 31613058
Carcinoma Hepatocellular Associate 31905170