GediPNet logo

COL4A2 (collagen type IV alpha 2 chain)

Gene
Entrez ID Entrez Gene ID - the GENE ID in NCBI Gene database.
1284
Gene nameGene Name - the full gene name approved by the HGNC.
Collagen type IV alpha 2 chain
Gene symbolGene Symbol - the official gene symbol approved by the HGNC, which is a short abbreviated form of the gene name.
COL4A2
SynonymsGene synonyms aliases
BSVD2, ICH, POREN2
ChromosomeChromosome number
13
Chromosome locationChromosomal Location - indicates the cytogenetic location of the gene or region on the chromosome.
13q34
SummarySummary of gene provided in NCBI Entrez Gene.
This gene encodes one of the six subunits of type IV collagen, the major structural component of basement membranes. The C-terminal portion of the protein, known as canstatin, is an inhibitor of angiogenesis and tumor growth. Like the other members of the type IV collagen gene family, this gene is organized in a head-to-head conformation with another type IV collagen gene so that each gene pair shares a common promoter. [provided by RefSeq, Jul 2008]
SNPsSNP information provided by dbSNP.
SNP ID Visualize variation Clinical significance Consequence
rs12877501 G>A,C Likely-pathogenic, likely-benign Missense variant, coding sequence variant
rs62621875 C>A Risk-factor, benign, likely-benign Missense variant, coding sequence variant
rs117412802 A>G Likely-benign, risk-factor, benign Missense variant, coding sequence variant
rs201105747 G>A,T Risk-factor, likely-benign, benign Coding sequence variant, missense variant
rs387906602 G>A Pathogenic Coding sequence variant, missense variant
miRNAmiRNA information provided by mirtarbase database.
miRTarBase ID miRNA Experiments Reference
MIRT000099 hsa-miR-29b-3p Luciferase reporter assay 22745231
MIRT000099 hsa-miR-29b-3p qRT-PCR 25744716
MIRT001926 hsa-miR-29c-3p Luciferase reporter assay, Reporter assay;Other 18390668
MIRT001926 hsa-miR-29c-3p Luciferase reporter assay 22745231
MIRT003220 hsa-miR-29a-3p Luciferase reporter assay, qRT-PCR, Western blot 20067797
Transcription factors
Transcription factor Regulation Reference
VHL Unknown 17700531
Gene ontology (GO)Gene ontology information of associated ontologies with gene provided by GO database.
GO ID Ontology Definition Evidence Reference
GO:0001525 Process Angiogenesis IEA
GO:0005201 Function Extracellular matrix structural constituent IBA 21873635
GO:0005201 Function Extracellular matrix structural constituent TAS 8317999
GO:0005515 Function Protein binding IPI 12011424
GO:0005576 Component Extracellular region TAS
Other IDsOther ids provides unique ids of gene in databases such as OMIM, HGNC, ENSEMBLE.
MIM
HGNC
e!Ensembl
Protein
UniProt ID P08572
Protein name Collagen alpha-2(IV) chain [Cleaved into: Canstatin]
Protein function Type IV collagen is the major structural component of glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen.; Canstatin, a cleavage product corresponding to the collagen alpha 2(IV) NC1 domain, possesses both anti-angiogenic and anti-tumor cell activity. It inhibits proliferation and migration of endothelial cells, reduces mitochondrial membrane potential, and induces apoptosis. Specifically induces Fas-dependent apoptosis and activates procaspase-8 and -9 activity. Ligand for alphavbeta3 and alphavbeta5 integrins.
PDB 1LI1 , 5NAX , 5NB2 , 6MPX
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF01391 Collagen
56 118
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
112 173
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
182 234
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
291 353
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
423 483
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
491 552
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
679 740
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
720 778
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
777 838
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
812 892
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
862 938
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
920 981
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1032 1096
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1098 1157
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1152 1214
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1278 1340
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1332 1396
Collagen triple helix repeat (20 copies)
Repeat
PF01413 C4
1490 1595
C-terminal tandem repeated domain in type 4 procollagen
Domain
PF01413 C4
1598 1710
C-terminal tandem repeated domain in type 4 procollagen
Domain
Sequence
MGRDQRAVAGPALRRWLLLGTVTVGFLAQSVLAGVKKFDVPCGGRDCSGGCQCYPEKGGR
GQPGPVGPQGYNGPPGLQGFPGLQGRKGDKGERGAPGVTGPKGDVGARGVS
GFPGADGIP
GHPGQGGPRGRPGYDGCNGTQGDSGPQGPPGSEGFTGPPGPQGPKGQKGEPYA
LPKEERD
RYRGEPGEPGLVGFQGPPGRPGHVGQMGPVGAPGRPGPPGPPGPKGQQGNRGLGFYGVKG
EKGDVGQPGPNGIPSDTLHPIIAPTGVTFHPDQYKGEKGSEGEPGIRGISLKGEEGIMGF
PGLRGYPGLSGEKGSPGQKGSRGLDGYQGPDGPRGPKGEAGDPGPPGLPAYSP
HPSLAKG
ARGDPGFPGAQGEPGSQGEPGDPGLPGPPGLSIGDGDQRRGLPGEMGPKGFIGDPGIPAL
YGGPPGPDGKRGPPGPPGLPGPPGPDGFLFGLKGAKGRAGFPGLPGSPGARGPKGWKGDA
GEC
RCTEGDEAIKGLPGLPGPKGFAGINGEPGRKGDRGDPGQHGLPGFPGLKGVPGNIGA
PGPKGAKGDSRT
ITTKGERGQPGVPGVPGMKGDDGSPGRDGLDGFPGLPGPPGDGIKGPP
GDPGYPGIPGTKGTPGEMGPPGLGLPGLKGQRGFPGDAGLPGPPGFLGPPGPAGTPGQID
CDTDVKRAVGGDRQEAIQPGCIGGPKGLPGLPGPPGPTGAKGLRGIPGFAGADGGPGPRG
LPGDAGREGFPGPPGFIGPR
GSKGAVGLPGPDGSPGPIGLPGPDGPPGERGLPGEVLGAQ
PGPRGDAGVPGQPGLKGLPGDRGPPGFRGSQGMPGMPGLKGQPGLPGPSGQPGLYGPPGL
HGFPGAPGQEGPLGLPGIPGREGLPGDRGDPGDTGAPGPVGMKGLSGDRGDAGFTGEQGH
PGSPGFKGIDGMPGTPGLKGDRGSPGMDGFQGMPGLKGRPGFPGSKGEAGFFGIPGLKGL
AGEPGFKGSRGDPGPPGPPPV
ILPGMKDIKGEKGDEGPMGLKGYLGAKGIQGMPGIPGLS
GIPGLPGRPGHIKGVKGDIGVPGIPGLPGFPGVAGPPGITGFPGFIGSRGDKGAPGRAGL
YGEIGATGDFGDIGDT
INLPGRPGLKGERGTTGIPGLKGFFGEKGTEGDIGFPGITGVTG
VQGPPGLKGQT
GFPGLTGPPGSQGELGRIGLPGGKGDDGWPGAPGLPGFPGLRGIRGLHG
LPGTKGFPGSPGSD
IHGDPGFPGPPGERGDPGEANTLPGPVGVPGQKGDQGAPGERGPPG
SPGLQGFPGITPPSNISGAPGDKGAPGIFGLKGYRGPPGPPGSAALPGSKGDTGNPGAPG
TPGTKGWAGDS
GPQGRPGVFGLPGEKGPRGEQGFMGNTGPTGAVGDRGPKGPKGDPGFPG
APGTVGAPGIAGIPQK
IAVQPGTVGPQGRRGPPGAPGEMGPQGPPGEPGFRGAPGKAGPQ
GRGGVSAVPGFRGDEGPIGHQGPIGQEGAPGRPGSPGLPGMPGRSVSIGYLLVKHSQTDQ
EPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCNPGDVCYYASR
NDKSYWLSTTAPLPMMPVAEDEIKPYISRCSVCEA
PAIAIAVHSQDVSIPHCPAGWRSLW
IGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGRGTCHYYANKYSFWLTTIP
EQSFQGSPSADTLKAGLIRTHISRCQVCMK
NL
Sequence length 1712
Interactions View interactions

| © 2021, Biomedical Informatics Centre, NIRRH |
ICMR-National Institute for Research in Reproductive Health, Jehangir Merwanji Street, Parel, Mumbai-400012
Tel: +91-22-24192104, Fax No: +91-22-24139412