GediPNet logo

COL4A4 (collagen type IV alpha 4 chain)

Gene
Entrez ID Entrez Gene ID - the GENE ID in NCBI Gene database.
1286
Gene nameGene Name - the full gene name approved by the HGNC.
Collagen type IV alpha 4 chain
Gene symbolGene Symbol - the official gene symbol approved by the HGNC, which is a short abbreviated form of the gene name.
COL4A4
SynonymsGene synonyms aliases
ATS2, BFH, CA44
ChromosomeChromosome number
2
Chromosome locationChromosomal Location - indicates the cytogenetic location of the gene or region on the chromosome.
2q36.3
SummarySummary of gene provided in NCBI Entrez Gene.
This gene encodes one of the six subunits of type IV collagen, the major structural component of basement membranes. This particular collagen IV subunit, however, is only found in a subset of basement membranes. Like the other members of the type IV collagen gene family, this gene is organized in a head-to-head conformation with another type IV collagen gene so that each gene pair shares a common promoter. Mutations in this gene are associated with type II autosomal recessive Alport syndrome (hereditary glomerulonephropathy) and with familial benign hematuria (thin basement membrane disease). Two transcripts, differing only in their transcription start sites, have been identified for this gene and, as is common for collagen genes, multiple polyadenylation sites are found in the 3` UTR. [provided by RefSeq, Jul 2008]
SNPsSNP information provided by dbSNP.
SNP ID Visualize variation Clinical significance Consequence
rs35138315 G>C Pathogenic Coding sequence variant, stop gained, non coding transcript variant
rs55978207 C>A,T Likely-benign, conflicting-interpretations-of-pathogenicity Synonymous variant, coding sequence variant, non coding transcript variant
rs114969026 T>C Conflicting-interpretations-of-pathogenicity, benign Non coding transcript variant, genic upstream transcript variant, coding sequence variant, missense variant
rs121912858 C>T Pathogenic Non coding transcript variant, coding sequence variant, missense variant
rs121912859 G>T Pathogenic Non coding transcript variant, stop gained, coding sequence variant
miRNAmiRNA information provided by mirtarbase database.
miRTarBase ID miRNA Experiments Reference
MIRT022562 hsa-miR-124-3p Microarray 18668037
MIRT029341 hsa-miR-26b-5p Microarray 19088304
MIRT437823 hsa-miR-150-5p Luciferase reporter assay 23604143
MIRT618296 hsa-miR-4659b-3p HITS-CLIP 23824327
MIRT618297 hsa-miR-4659a-3p HITS-CLIP 23824327
Transcription factors
Transcription factor Regulation Reference
LMX1B Activation 11175791
Gene ontology (GO)Gene ontology information of associated ontologies with gene provided by GO database.
GO ID Ontology Definition Evidence Reference
GO:0005201 Function Extracellular matrix structural constituent IBA 21873635
GO:0005201 Function Extracellular matrix structural constituent IMP 17942953, 19675380
GO:0005576 Component Extracellular region TAS
GO:0005587 Component Collagen type IV trimer IBA 21873635
GO:0005587 Component Collagen type IV trimer IDA 7523402
Other IDsOther ids provides unique ids of gene in databases such as OMIM, HGNC, ENSEMBLE.
MIM
HGNC
e!Ensembl
Protein
UniProt ID P53420
Protein name Collagen alpha-4(IV) chain
Protein function Type IV collagen is the major structural component of glomerular basement membranes (GBM), forming a 'chicken-wire' meshwork together with laminins, proteoglycans and entactin/nidogen.
PDB 5NB1
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF01391 Collagen
62 121
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
118 176
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
181 238
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
296 360
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
365 428
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
400 457
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
494 559
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
906 967
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
969 1029
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1015 1080
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1082 1141
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1133 1195
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1196 1253
Collagen triple helix repeat (20 copies)
Repeat
PF01391 Collagen
1313 1372
Collagen triple helix repeat (20 copies)
Repeat
PF01413 C4
1466 1571
C-terminal tandem repeated domain in type 4 procollagen
Domain
PF01413 C4
1574 1688
C-terminal tandem repeated domain in type 4 procollagen
Domain
Sequence
MWSLHIVLMRCSFRLTKSLATGPWSLILILFSVQYVYGSGKKYIGPCGGRDCSVCHCVPE
KGSRGPPGPPGPQGPIGPLGAPGPIGLSGEKGMRGDRGPPGAAGDKGDKGPTGVPGFPGL
D
GIPGHPGPPGPRGKPGMSGHNGSRGDPGFPGGRGALGPGGPLGHPGEKGEKGNSV
FILG
AVKGIQGDRGDPGLPGLPGSWGAGGPAGPTGYPGEPGLVGPPGQPGRPGLKGNPGVGVKG
QMGDPGEVGQQGSPGPTLLVEPPDFCLYKGEKGIKGIPGMVGLPGPPGRKGESGIGAKGE
KGIPGFPGPRGDPGSYGSPGFPGLKGELGLVGDPGLFGLIGPKGDPGNRGHPGPPGVLVT

PPLPLKGPPGDPGFPGRYGETGDVGPPGPPGLLGRPGEACAGMIGPPGPQGFPGLPGLPG
EAGIPGRP
DSAPGKPGKPGSPGLPGAPGLQGLPGSSV
IYCSVGNPGPQGIKGKVGPPGGR
GPKGEKGNEGLCACEPGPMGPPGPPGLPGRQGSKGDLGLPGWLGTKGDPGPPGAEGPPGL
PGKHGASGPPGNKGAKGDM
VVSRVKGHKGERGPDGPPGFPGQPGSHGRDGHAGEKGDPGP
PGDHEDATPGGKGFPGPLGPPGKAGPVGPPGLGFPGPPGERGHPGVPGHPGVRGPDGLKG
QKGDTISCNVTYPGRHGPPGFDGPPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTAEIPGP
PGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPGDPAFGHLGPPGKRGLSGVPG
IKGPRGDPGCPGAEGPAGIPGFLGLKGPKGREGHAGFPGVPGPPGHSCERGAPGIPGQPG
LPGYPGSPGAPGGKGQPGDVGPPGPAGMKGLPGLPGRPGAHGPPGLPGIPGPFGDDGLPG
PPGPKGPRGLPGFPGFPGERGKPGAEGCPGAKGEPGEKGMSGLPGDRGLRGAKGAIGPPG
DEGEMAI
ISQKGTPGEPGPPGDDGFPGERGDKGTPGMQGRRGEPGRYGPPGFHRGEPGEK
GQPGPPGPP
GPPGSTGLRGFIGFPGLPGDQGEPGSPGPPGFSGIDGARGPKGNKGDPASH

FGPPGPKGEPGSPGCPGHFGASGEQGLPGIQGPRGSPGRPGPPGSSGPPGCPGDHGMPGL
R
GQPGEMGDPGPRGLQGDPGIPGPPGIKGPSGSPGLNGLHGLKGQKGTKGASGLH
DVGPP
GPVGIPGLKGERGDPGSPGISPPGPRGKKGPPGPPGSSGPPGPAGATGRAPKD
IPDPGPP
GDQGPPGPDGPRGAPGPPGLPGSVDLLRGEPGDCGLPGPPGPPGPPGPPGYKGFPGCDGK
DGQKGPVGFPGPQGPHGFPGPPGEKGLPGPPGRKGPTGLPGPRGEPGPPADV
DDCPRIPG
LPGAPGMRGPEGAMGLPGMRGPSGPGCKGEPGLDGRRGVDGVPGSPGPPGRKGDTGEDGY
PGGPGPPGPIGDPGPKGFGPGYLGGFLLVLHSQTDQEPTCPLGMPRLWTGYSLLYLEGQE
KAHNQDLGLAGSCLPVFSTLPFAYCNIHQVCHYAQRNDRSYWLASAAPLPMMPLSEEAIR
PYVSRCAVCEA
PAQAVAVHSQDQSIPPCPQTWRSLWIGYSFLMHTGAGDQGGGQALMSPG
SCLEDFRAAPFLECQGRQGTCHFFANKYSFWLTTVKADLQFSSAPAPDTLKESQAQRQKI
SRCQVCVK
YS
Sequence length 1690
Interactions View interactions

| © 2021, Biomedical Informatics Centre, NIRRH |
ICMR-National Institute for Research in Reproductive Health, Jehangir Merwanji Street, Parel, Mumbai-400012
Tel: +91-22-24192104, Fax No: +91-22-24139412