GediPNet logo

COL4A3 (collagen type IV alpha 3 chain)

Gene
Entrez ID Entrez Gene ID - the GENE ID in NCBI Gene database.
1285
Gene nameGene Name - the full gene name approved by the HGNC.
Collagen type IV alpha 3 chain
Gene symbolGene Symbol - the official gene symbol approved by the HGNC, which is a short abbreviated form of the gene name.
COL4A3
SynonymsGene synonyms aliases
ATS2, ATS3
ChromosomeChromosome number
2
Chromosome locationChromosomal Location - indicates the cytogenetic location of the gene or region on the chromosome.
2q36.3
SummarySummary of gene provided in NCBI Entrez Gene.
Type IV collagen, the major structural component of basement membranes, is a multimeric protein composed of 3 alpha subunits. These subunits are encoded by 6 different genes, alpha 1 through alpha 6, each of which can form a triple helix structure with 2 other subunits to form type IV collagen. This gene encodes alpha 3. In the Goodpasture syndrome, autoantibodies bind to the collagen molecules in the basement membranes of alveoli and glomeruli. The epitopes that elicit these autoantibodies are localized largely to the non-collagenous C-terminal domain of the protein. A specific kinase phosphorylates amino acids in this same C-terminal region and the expression of this kinase is upregulated during pathogenesis. This gene is also linked to an autosomal recessive form of Alport syndrome. The mutations contributing to this syndrome are also located within the exons that encode this C-terminal region. Like the other members of the type IV collagen gene family, this gene is organized in a head-to-head conformation with another type IV collagen gene so that each gene pair shares a common promoter. [provided by RefSeq, Jun 2010]
miRNAmiRNA information provided by mirtarbase database.
miRTarBase ID miRNA Experiments Reference
MIRT047553 hsa-miR-10a-5p CLASH 23622248
MIRT527437 hsa-miR-302d-5p PAR-CLIP 22012620
MIRT527438 hsa-miR-302b-5p PAR-CLIP 22012620
MIRT527439 hsa-miR-548t-3p PAR-CLIP 22012620
MIRT527440 hsa-miR-548ap-3p PAR-CLIP 22012620
Transcription factors
Transcription factor Regulation Reference
ZEB1 Unknown 22199242
Gene ontology (GO)Gene ontology information of associated ontologies with gene provided by GO database.
GO ID Ontology Definition Evidence Reference
GO:0005178 Function Integrin binding IDA 12682293
GO:0005178 Function Integrin binding TAS 10766752
GO:0005198 Function Structural molecule activity NAS 3025878
GO:0005201 Function Extracellular matrix structural constituent IBA 21873635
GO:0005515 Function Protein binding IPI 10212244, 12682293
Other IDsOther ids provides unique ids of gene in databases such as OMIM, HGNC, ENSEMBLE.
MIM
HGNC
e!Ensembl
Protein
UniProt ID Q01955
Protein name Collagen alpha-3(IV) chain (Goodpasture antigen) [Cleaved into: Tumstatin]
Protein function Type IV collagen is the major structural component of glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen.; Tumstatin, a cleavage fragment corresponding to the collagen alpha 3(IV) NC1 domain, possesses both anti-angiogenic and anti-tumor cell activity; these two anti-tumor properties may be regulated via RGD-independent ITGB3-mediated mechanisms.
PDB 5NB0
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF01391 Collagen
41 104
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
97 162
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
169 223
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
282 344
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
351 412
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
386 444
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
413 478
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
482 546
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
588 648
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
699 749
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
747 809
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
788 849
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
847 905
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
892 948
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
950 1009
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
997 1060
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1061 1122
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1119 1178
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1176 1235
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1292 1352
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1379 1441
Collagen triple helix repeat (20 copies)
Repeat
PF01413 C4
1446 1553
C-terminal tandem repeated domain in type 4 procollagen
Domain
PF01413 C4
1556 1667
C-terminal tandem repeated domain in type 4 procollagen
Domain
Sequence
MSARTAPRPQVLLLPLLLVLLAAAPAASKGCVCKDKGQCFCDGAKGEKGEKGFPGPPGSP
GQKGFTGPEGLPGPQGPKGFPGLPGLTGSKGVRGIS
GLPGFSGSPGLPGTPGNTGPYGLV
GVPGCSGSKGEQGFPGLPGTLGYPGIPGAAGLKGQKGAPAKE
EDIELDAKGDPGLPGAPG
PQGLPGPPGFPGPVGPPGPPGFFGFPGAMGPRGPKGHMGERVI
GHKGERGVKGLTGPPGP
PGTVIVTLTGPDNRTDLKGEKGDKGAMGEPGPPGPSGLPGESYGSEKGAPGDPGLQGKPG
KDGVPGFPGSEGVKGNRGFPGLMGEDGIKGQKGDIGPPGFRGPT
EYYDTYQEKGDEGTPG
PPGPRGARGPQGPSGPPGVPGSPGS
SRPGLRGAPGWPGLKGSKGERGRPGKDAMGTPGSP
GCAGSPGLPGSPGPPGPPGDIVFR
KGPPGDHGLPGYLGSPGIPGVDGPKGEPGLLCTQ
CP
YIPGPPGLPGLPGLHGVKGIPGRQGAAGLKGSPGSPGNTGLPGFPGFPGAQGDPGLKGEK
GETLQP
EGQVGVPGDPGLRGQPGRKGLDGIPGTPGVKGLPGPKGELALSGEKGDQGPPGD
PGSPGSPGPAGPAGPPGYGPQGEPGLQGTQGVPGAPGPPGEAGPRGEL
SVSTPVPGPPGP
PGPPGHPGPQGPPGIPGSLGKCGDPGLPGPDGEPGIPGIGFPGPPGPKGDQGFPGTKGSL
GCPGKMGEPGLPGKPGLPGAKGEPAV
AMPGGPGTPGFPGERGNSGEHGEIGLPGLPGLPG
TPGNEGLDGPRGDPGQPGPPGEQGPPGRCIEGPRGAQGLPGLNGLKGQQGRRGKTGPKGD
PGIPGLDRSGFPGETGSPGIPGHQGEMGPLGQRGYPGNPGILGPPGEDGVIGMMGFPGAI
GPPGP
PGNPGTPGQRGSPGIPGVKGQRGTPGAKGEQGDKGNPGPSEIS
HVIGDKGEPGLK
GFAGNPGEKGNRGVPGMPGLKGLKGLPGPAGPPGPR
GDLGSTGNPGEPGLRGIPGSMGNM
GMPGSKGKRGTLGFPGRAGRPGLPGIHGLQGDKGEPGYSE
GTRPGPPGPTGDPGLPGDMG
KKGEMGQPGPPGHLGPAGPEGAPGSPGSPGLPGKPGPH
GDLGFKGIKGLLGPPGIRGPPG
LPGFPGSPGPMGIRGDQGRDGIPGPAGEKGETGLLRAPPGPRGNPGAQGAKGDRGAPGFP
GLPGRKGAMGDAGPRGPTGIEGFPGPPGLPGAIIP
GQTGNRGPPGSRGSPGAPGPPGPPG
SHVIGIKGDKGSMGHPGPKGPPGTAGDMGPPGRLGAPGTPGLPGPRGDPGFQGFPGVKGE
KGNPGFLGSIGPPGPIGPKGPPGVRGDPGTLK
IISLPGSPGPPGTPGEPGMQGEPGPPGP
PGNLGPCGPRGKPGKDGKPGTPGPAGEKGNKGSKGEPGPAGSDGLPGLKGKRGDSGSPAT
W
TTRGFVFTRHSQTTAIPSCPEGTVPLYSGFSFLFVQGNQRAHGQDLGTLGSCLQRFTTM
PFLFCNVNDVCNFASRNDYSYWLSTPALMPMNMAPITGRALEPYISRCTVCEG
PAIAIAV
HSQTTDIPPCPHGWISLWKGFSFIMFTSAGSEGTGQALASPGSCLEEFRASPFLECHGRG
TCNYYSNSYSFWLASLNPERMFRKPIPSTVKAGELEKIISRCQVCMK
KRH
Sequence length 1670
Interactions View interactions

| © 2021, Biomedical Informatics Centre, NIRRH |
ICMR-National Institute for Research in Reproductive Health, Jehangir Merwanji Street, Parel, Mumbai-400012
Tel: +91-22-24192104, Fax No: +91-22-24139412