Gene
Entrez ID Entrez Gene ID - the GENE ID in NCBI Gene database.
1294
Gene name Gene Name - the full gene name approved by the HGNC.
Collagen type VII alpha 1 chain
Gene symbol Gene Symbol - the official gene symbol approved by the HGNC.
COL7A1
Synonyms (NCBI Gene) Gene synonyms aliases
EBD1, EBDCT, EBR1, NDNC8
Chromosome Chromosome number
3
Chromosome location Chromosomal Location - indicates the cytogenetic location of the gene or region on the chromosome.
3p21.31
Summary Summary of gene provided in NCBI Entrez Gene.
This gene encodes the alpha chain of type VII collagen. The type VII collagen fibril, composed of three identical alpha collagen chains, is restricted to the basement zone beneath stratified squamous epithelia. It functions as an anchoring fibril between
SNPs SNP information provided by dbSNP.
SNP ID Visualize variation Clinical significance Consequence
rs79378857 G>A Conflicting-interpretations-of-pathogenicity Coding sequence variant, missense variant, non coding transcript variant
rs116005007 C>A Conflicting-interpretations-of-pathogenicity, uncertain-significance Non coding transcript variant, missense variant, coding sequence variant
rs121912828 A>T Pathogenic Non coding transcript variant, missense variant, coding sequence variant, genic downstream transcript variant
rs121912829 C>T Pathogenic Non coding transcript variant, missense variant, coding sequence variant
rs121912830 G>A,T Pathogenic, likely-benign Non coding transcript variant, stop gained, synonymous variant, coding sequence variant
miRNA miRNA information provided by mirtarbase database.
miRTarBase ID miRNA Experiments Reference
MIRT006382 hsa-miR-29c-3p Luciferase reporter assay 21436257
MIRT006382 hsa-miR-29c-3p Luciferase reporter assay 21436257
MIRT006382 hsa-miR-29c-3p Luciferase reporter assay 21436257
MIRT053739 hsa-miR-29b-3p Microarray 22942087
MIRT903179 hsa-miR-1207-5p CLIP-seq
Transcription factors
Transcription factor Regulation Reference
NFKB1 Unknown 10086338
RELA Unknown 10086338
SMAD3 Unknown 9843964
SMAD4 Unknown 9843964
SP1 Unknown 10980546;9092567
Gene ontology (GO) Gene ontology information of associated ontologies with gene provided by GO database.
GO ID Ontology Definition Evidence Reference
GO:0004867 Function Serine-type endopeptidase inhibitor activity IEA
GO:0005515 Function Protein binding IPI 19269366
GO:0005576 Component Extracellular region IEA
GO:0005576 Component Extracellular region TAS
GO:0005581 Component Collagen trimer IEA
Other IDs Other ids provides unique ids of gene in databases such as OMIM, HGNC, ENSEMBLE.
MIM HGNC e!Ensembl
120120 2214 ENSG00000114270
Protein
UniProt ID Q02388
Protein name Collagen alpha-1(VII) chain (Long-chain collagen) (LC collagen)
Protein function Stratified squamous epithelial basement membrane protein that forms anchoring fibrils which may contribute to epithelial basement membrane organization and adherence by interacting with extracellular matrix (ECM) proteins such as type IV collage
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00092 VWA 38 209 von Willebrand factor type A domain Domain
PF00041 fn3 233 318 Fibronectin type III domain Domain
PF00041 fn3 331 407 Fibronectin type III domain Domain
PF00041 fn3 419 492 Fibronectin type III domain Domain
PF00041 fn3 509 587 Fibronectin type III domain Domain
PF00041 fn3 599 677 Fibronectin type III domain Domain
PF00041 fn3 687 765 Fibronectin type III domain Domain
PF00041 fn3 777 855 Fibronectin type III domain Domain
PF00041 fn3 868 946 Fibronectin type III domain Domain
PF00092 VWA 1054 1227 von Willebrand factor type A domain Domain
PF01391 Collagen 1251 1310 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1296 1361 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1449 1504 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 1489 1548 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 2034 2099 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 2099 2158 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 2257 2321 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 2317 2374 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 2373 2430 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 2407 2465 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 2466 2524 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 2526 2584 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 2563 2639 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 2617 2694 Collagen triple helix repeat (20 copies) Repeat
PF01391 Collagen 2722 2787 Collagen triple helix repeat (20 copies) Repeat
PF00014 Kunitz_BPTI 2875 2930 Kunitz/Bovine pancreatic trypsin inhibitor domain Domain
Sequence
MTLRLLVAALCAGILAEAPRVRAQHRERVTCTRLYAADIVFLLDGSSSIGRSNFREVRSF
LEGLVLPFSGAASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTG
AAILHVADHVFLPQLARPGVPKVCILITDGKSQDLVDTAAQRLKGQGVKLFAVGIKNADP
EELKRVASQPTSDFFFFVNDFSILRTLLP
LVSRRVCTTAGGVPVTRPPDDSTSAPRDLVL
SEPSSQSLRVQWTAASGPVTGYKVQYTPLTGLGQPLPSERQEVNVPAGETSVRLRGLRPL
TEYQVTVIALYANSIGEA
VSGTARTTALEGPELTIQNTTAHSLLVAWRSVPGATGYRVTW
RVLSGGPTQQQELGPGQGSVLLRDLEPGTDYEVTVSTLFGRSVGPAT
SLMARTDASVEQT
LRPVILGPTSILLSWNLVPEARGYRLEWRRETGLEPPQKVVLPSDVTRYQLDGLQPGTEY
RLTLYTLLEGHE
VATPATVVPTGPELPVSPVTDLQATELPGQRVRVSWSPVPGATQYRII
VRSTQGVERTLVLPGSQTAFDLDDVQAGLSYTVRVSARVGPREGSAS
VLTVRREPETPLA
VPGLRVVVSDATRVRVAWGPVPGASGFRISWSTGSGPESSQTLPPDSTATDITGLQPGTT
YQVAVSVLRGREEGPAA
VIVARTDPLGPVRTVHVTQASSSSVTITWTRVPGATGYRVSWH
SAHGPEKSQLVSGEATVAELDGLEPDTEYTVHVRAHVAGVDGPPA
SVVVRTAPEPVGRVS
RLQILNASSDVLRITWVGVTGATAYRLAWGRSEGGPMRHQILPGNTDSAEIRGLEGGVSY
SVRVTALVGDREGTP
VSIVVTTPPEAPPALGTLHVVQRGEHSLRLRWEPVPRAQGFLLHW
QPEGGQEQSRVLGPELSSYHLDGLEPATQYRVRLSVLGPAGEGPSA
EVTARTESPRVPSI
ELRVVDTSIDSVTLAWTPVSRASSYILSWRPLRGPGQEVPGSPQTLPGISSSQRVTGLEP
GVSYIFSLTPVLDGVRGPEASVTQTPVCPRGLADVVFLPHATQDNAHRAEATRRVLERLV
LALGPLGPQAVQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGNNLGTAVVT
AHRYMLAPDAPGRRQHVPGVMVLLVDEPLRGDIFSPIREAQASGLNVVMLGMAGADPEQL
RRLAPGMDSVQTFFAVDDGPSLDQAVS
GLATALCQASFTTQPRPEPCPVYCPKGQKGEPG
EMGLRGQVGPPGDPGLPGRTGAPGPQGPPGSATAK
GERGFPGADGRPGSPGRAGNPGTPG
APGLKGSPGLPGPRGDPGERGPRGPKGEPGAPGQVIGGEGP
GLPGRKGDPGPSGPPGPRG
PLGDPGPRGPPGLPGTAMKGDKGDRGERGPPGPGEGGIAPGEPGLPGLPGSPGPQGPVGP
PGKKGEKGDSEDGAPGLPGQPGSPGEQGPRGPPGAIGPKGDRGFPGPLGEAGEKGERGPP
GPAG
SRGLPGVAGRPGAKGPEGPPGPTGRQGEKGEPGRPGDPAVVGPA
VAGPKGEKGDVG
PAGPRGATGVQGERGPPGLVLPGDPGPKGDPGDRGPIGLTGRAGPPGDSGPPGEKGDPGR
PGPPGPVGPRGRDGEVGEKGDEGPPGDPGLPGKAGERGLRGAPGVRGPVGEKGDQGDPGE
DGRNGSPGSSGPKGDRGEPGPPGPPGRLVDTGPGAREKGEPGDRGQEGPRGPKGDPGLPG
APGERGIEGFRGPPGPQGDPGVRGPAGEKGDRGPPGLDGRSGLDGKPGAAGPSGPNGAAG
KAGDPGRDGLPGLRGEQGLPGPSGPPGLPGKPGEDGKPGLNGKNGEPGDPGEDGRKGEKG
DSGASGREGRDGPKGERGAPGILGPQGPPGLPGPVGPPGQGFPGVPGGTGPKGDRGETGS
KGEQGLPGERGLRGEPGSVPNVDRLLETAGIKASALREIVETWDESSGSFLPVPERRRGP
KGDSGEQGPPGKEGPIGFPGERGLKGDRGDPGPQGPPGLALGERGPPGPSGLAGEPGKPG
IPGLPGRAGGVGEAGRPGERGERGEKGERGEQGRDGPPGLPGTPGPPGPPGPKVSVDE
PG
PGLSGEQGPPGLKGAKGEPGSNGDQGPKGDRGVPGIKGDRGEPGPRGQDGNPGLPGER
GM
AGPEGKPGLQGPRGPPGPVGGHGDPGPPGAPGLAGPAGPQGPSGLKGEPGETGPPGRGLT
GPTGAVGLPGPPGPSGLVGPQGSPGLPGQVGETGKPGAPGRDGASGKDGDRGSPGVPGSP
GLPGPVGPKGEPGPTGAPGQAVVGLPGAKGEKGAPG
GLAGDLVGEPGAKGDRGLPGPRGE
KGEAGRAGEPGDPGEDGQKGAPGPKGFKGDPGVGVPGSPGPPGPPGVKGDLGLPGLPGAP
GVVGFPGQTGPRGEMGQPGPSGERGLAGPPGREGIPGPLGPPGPPGSVGPPGASGLKGDK
GDPGV
GLPGPRGERGEPGIRGEDGRPGQEGPRGLTGPPGSRGERGEKGDVGSAGLKGDKG
DSAV
ILGPPGPRGAKGDMGERGPRGLDGDKGPRGDNGDPGDKGSKGEPGDKGSAGLPGLR
GLLG
PQGQPGAAGIPGDPGSPGKDGVPGIRGEKGDVGFMGPRGLKGERGVKGACGLDGEK
GDKGEAGPPGRPGLAGHKGEMGEPGVPGQSGAPGKEGLIGPKGDRGFDGQPGPK
GDQGEK
GERGTPGIGGFPGPSGNDGSAGPPGPPGSVGPRGPEGLQGQKGERGPPGERVVGAPGVPG
APGERGEQGRPGPAGPRGEKGEAALTE
DDIRGFVRQEMSQHCACQGQFIASGSRPLPSYA
ADTAGSQLHAVPVLRVSHAEEEERVPPEDDEYSEYSEYSVEEYQDPEAPWDSDDPCSLPL
DEGSCTAYTLRWYHRAVTGSTEACHPFVYGGCGGNANRFGTREACERRCP
PRVVQSQGTG
TAQD
Sequence length 2944
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG   Reactome
  Protein digestion and absorption   Collagen degradation
Extracellular matrix organization
Collagen biosynthesis and modifying enzymes
Assembly of collagen fibrils and other multimeric structures
COPII-mediated vesicle transport
Integrin cell surface interactions
Anchoring fibril formation
Laminin interactions
Cargo concentration in the ER
Collagen chain trimerization
<
Associated diseases Disease associations categorized as Causal (pathogenic variants), Unknown (uncertain genetic evidence), or Text Mining (literature-based associations)
Causal Diseases caused by PATHOGENIC or LIKELY PATHOGENIC variants from ClinVar only
Disease merge term Disease name dbSNP ID References
Dystrophic Epidermolysis Bullosa Recessive dystrophic epidermolysis bullosa, Pretibial dystrophic epidermolysis bullosa, transient bullous dermolysis of the newborn rs780261665, rs121912830, rs121912836, rs1064797079, rs757415879, rs1559423385, rs1575494051, rs1368134215, rs886041186, rs761234904, rs1575442301, rs1064797078, rs1553862581, rs760063197, rs121912851
View all (52 more)
N/A
Epidermolysis Bullosa epidermolysis bullosa dystrophica, Epidermolysis bullosa pruriginosa, autosomal dominant, Epidermolysis bullosa, pretibial, autosomal recessive rs121912849, rs121912854, rs760063197, rs786204774, rs121912836, rs886058642, rs1032335328, rs780623622, rs121912844, rs374718902, rs762162799, rs1553612928, rs121912837, rs886039330, rs121912845
View all (40 more)
N/A
Epidermolysis Bullosa Dystrophica Inversa epidermolysis bullosa dystrophica inversa, autosomal recessive rs761234904, rs768128088, rs143457874, rs1131691385, rs775288140, rs387906604, rs760891216, rs757688782, rs772756089, rs767182886, rs200972872, rs121912856, rs121912847, rs773263825, rs201728948
View all (4 more)
N/A
Epidermolysis Bullosa Pruriginosa epidermolysis bullosa pruriginosa rs121912855, rs1064793916, rs144023803, rs1057518863, rs780261665 N/A
Unknown Includes: (1) ClinVar NON-pathogenic variants (Uncertain, Benign, Conflicting, VUS), (2) GenCC associations, (3) GWAS associations, (4) CBGDA evidence-based associations. NOTE: Diseases with pathogenic evidence are excluded to avoid conflicts.
Disease merge term Disease name Evidence References Source
Amelogenesis imperfecta Amelogenesis imperfecta type 1 N/A N/A ClinVar
Duane Retraction Syndrome duane retraction syndrome N/A N/A ClinVar
Hepatoblastoma hepatoblastoma N/A N/A ClinVar
Associations from Text Mining Disease associations identified through Pubtator
Disease Name Relationship Type References
Adenocarcinoma of Lung Associate 32031203
Amyloidosis Associate 25566895
amyloidosis IX Associate 25566895
Anemia Aplastic Associate 9284109
Aortic Valve Insufficiency Associate 12558638
Arnold Chiari Malformation Associate 33974636
Blister Associate 19726672, 20720561, 25569093, 27434145, 31578311, 35121199, 35163654, 36901775, 8288900, 8592061, 9242516, 9668111
Carcinoma Adenoid Cystic Associate 1850773
Carcinoma Renal Cell Associate 32789468
Carcinoma Squamous Cell Associate 17853916, 35993054, 36901775, 37077084