Gene Gene information from NCBI Gene database.
Entrez ID 3339
Gene name Heparan sulfate proteoglycan 2
Gene symbol HSPG2
Synonyms (NCBI Gene)
HSPGPLCPRCANSJASJSSJS1
Chromosome 1
Chromosome location 1p36.12
Summary This gene encodes the perlecan protein, which consists of a core protein to which three long chains of glycosaminoglycans (heparan sulfate or chondroitin sulfate) are attached. The perlecan protein is a large multidomain proteoglycan that binds to and cro
SNPs SNP information provided by dbSNP.
42
SNP ID Visualize variation Clinical significance Consequence
rs2229490 C>A,T Conflicting-interpretations-of-pathogenicity Coding sequence variant, missense variant
rs55875654 G>A Uncertain-significance, conflicting-interpretations-of-pathogenicity, likely-benign Coding sequence variant, synonymous variant
rs62642527 C>T Conflicting-interpretations-of-pathogenicity Missense variant, coding sequence variant
rs62642535 T>A,G Conflicting-interpretations-of-pathogenicity Missense variant, coding sequence variant
rs111866498 G>A Uncertain-significance, benign, conflicting-interpretations-of-pathogenicity Synonymous variant, coding sequence variant
miRNA miRNA information provided by mirtarbase database.
156
miRTarBase ID miRNA Experiments Reference
MIRT007259 hsa-miR-663a Luciferase reporter assay 23436656
MIRT007259 hsa-miR-663a Luciferase reporter assay 23436656
MIRT007259 hsa-miR-663a Luciferase reporter assay 23436656
MIRT018272 hsa-miR-335-5p Microarray 18185580
MIRT440478 hsa-miR-218-5p HITS-CLIP 23212916
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
47
GO ID Ontology Definition Evidence Reference
GO:0001525 Process Angiogenesis IEA
GO:0001540 Function Amyloid-beta binding IC 21289173
GO:0001666 Process Response to hypoxia IEA
GO:0001938 Process Positive regulation of endothelial cell proliferation IEA
GO:0005178 Function Integrin binding TAS 21126803
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
142461 5273 ENSG00000142798
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
P98160
Protein name Basement membrane-specific heparan sulfate proteoglycan core protein (HSPG) (Perlecan) (PLC) [Cleaved into: Endorepellin; LG3 peptide]
Protein function Integral component of basement membranes. Component of the glomerular basement membrane (GBM), responsible for the fixed negative electrostatic membrane charge, and which provides a barrier which is both size- and charge-selective. It serves as
PDB 3SH4 , 3SH5
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00057 Ldl_recept_a 197 234 Low-density lipoprotein receptor domain class A Repeat
PF00057 Ldl_recept_a 283 319 Low-density lipoprotein receptor domain class A Repeat
PF00057 Ldl_recept_a 323 359 Low-density lipoprotein receptor domain class A Repeat
PF00057 Ldl_recept_a 366 403 Low-density lipoprotein receptor domain class A Repeat
PF13927 Ig_3 405 483 Domain
PF00052 Laminin_B 595 729 Laminin B (Domain IV) Family
PF00053 Laminin_EGF 764 811 Laminin EGF domain Domain
PF00053 Laminin_EGF 814 869 Laminin EGF domain Domain
PF00053 Laminin_EGF 867 921 Laminin EGF domain Domain
PF00052 Laminin_B 990 1124 Laminin B (Domain IV) Family
PF00053 Laminin_EGF 1116 1156 Laminin EGF domain Domain
PF00053 Laminin_EGF 1159 1206 Laminin EGF domain Domain
PF00053 Laminin_EGF 1209 1264 Laminin EGF domain Domain
PF00053 Laminin_EGF 1275 1322 Laminin EGF domain Domain
PF00052 Laminin_B 1396 1528 Laminin B (Domain IV) Family
PF00053 Laminin_EGF 1523 1556 Laminin EGF domain Domain
PF00053 Laminin_EGF 1563 1610 Laminin EGF domain Domain
PF00053 Laminin_EGF 1613 1668 Laminin EGF domain Domain
PF13927 Ig_3 1676 1750 Domain
PF13927 Ig_3 1769 1844 Domain
PF07679 I-set 1866 1950 Immunoglobulin I-set domain Domain
PF07679 I-set 1956 2042 Immunoglobulin I-set domain Domain
PF13927 Ig_3 2050 2121 Domain
PF13927 Ig_3 2153 2221 Domain
PF13927 Ig_3 2245 2315 Domain
PF13927 Ig_3 2342 2410 Domain
PF07679 I-set 2437 2520 Immunoglobulin I-set domain Domain
PF13927 Ig_3 2535 2604 Domain
PF13927 Ig_3 2629 2700 Domain
PF13927 Ig_3 2728 2796 Domain
PF07679 I-set 2827 2910 Immunoglobulin I-set domain Domain
PF13895 Ig_2 2926 3009 Immunoglobulin domain Domain
PF13927 Ig_3 3021 3095 Domain
PF13927 Ig_3 3111 3189 Domain
PF07679 I-set 3212 3295 Immunoglobulin I-set domain Domain
PF07679 I-set 3299 3382 Immunoglobulin I-set domain Domain
PF07679 I-set 3400 3483 Immunoglobulin I-set domain Domain
PF07679 I-set 3489 3572 Immunoglobulin I-set domain Domain
PF07679 I-set 3576 3658 Immunoglobulin I-set domain Domain
PF00054 Laminin_G_1 3692 3831 Laminin G domain Domain
PF00008 EGF 3848 3879 EGF-like domain Domain
PF00054 Laminin_G_1 3957 4088 Laminin G domain Domain
PF00008 EGF 4108 4139 EGF-like domain Domain
PF02210 Laminin_G_2 4234 4364 Laminin G domain Domain
Tissue specificity TISSUE SPECIFICITY: Detected in cerebrospinal fluid, fibroblasts and urine (at protein level). {ECO:0000269|PubMed:25326458, ECO:0000269|PubMed:36213313}.
Sequence
MGWRAAGALLLALLLHGRLLAVTHGLRAYDGLSLPEDIETVTASQMRWTHSYLSDDEDML
ADSISGDDLGSGDLGSGDFQMVYFRALVNFTRSIEYSPQLEDAGSREFREVSEAVVDTLE
SEYLKIPGDQVVSVVFIKELDGWVFVELDVGSEGNADGAQIQEMLLRVISSGSVASYVTS
PQGFQFRRLGTVPQFPRACTEAEFACHSYNECVALEYRCDRRPDCRDMSDELNCEEPVLG
ISPTFSLLVETTSLPPRPETTIMRQPPVTHAPQPLLPGSVRPLPCGPQEAACRNGHCIPR
DYLCDGQEDCEDGSDELDC
GPPPPCEPNEFPCGNGHCALKLWRCDGDFDCEDRTDEANCP
TKRPEEVCGPTQFRCVSTNMCIPASFHCDEESDCPDRSDEFGCMPPQVVTPPRESIQASR
GQTVTFTCVAIGVPTPIINWRLNWGHIPSHPRVTVTSEGGRGTLIIRDVKESDQGAYTCE
AMN
ARGMVFGIPDGVLELVPQRGPCPDGHFYLEHSAACLPCFCFGITSVCQSTRRFRDQI
RLRFDQPDDFKGVNVTMPAQPGTPPLSSTQLQIDPSLHEFQLVDLSRRFLVHDSFWALPE
QFLGNKVDSYGGSLRYNVRYELARGMLEPVQRPDVVLMGAGYRLLSRGHTPTQPGALNQR
QVQFSEEHWVHESGRPVQRAELLQVLQSLEAVLIQTVYNTKMASVGLSDIAMDTTVTHAT
SHGRAHSVE
ECRCPIGYSGLSCESCDAHFTRVPGGPYLGTCSGCNCNGHASSCDPVYGHC
LNCQHNTEGPQCNKCKAGFFGDAMKATATSC
RPCPCPYIDASRRFSDTCFLDTDGQATCD
ACAPGYTGRRCESCAPGYEGNPIQPG
GKCRPVNQEIVRCDERGSMGTSGEACRCKNNVVG
RLCNECADGSFHLSTRNPDGC
LKCFCMGVSRHCTSSSWSRAQLHGASEEPGHFSLTNAAS
THTTNEGIFSPTPGELGFSSFHRLLSGPYFWSLPSRFLGDKVTSYGGELRFTVTQRSQPG
STPLHGQPLVVLQGNNIILEHHVAQEPSPGQPSTFIVPFREQAWQRPDGQPATREHLLMA
LAGIDTLLIRASYAQQPAESRVSGISMDVAVPEET
GQDPALEVEQCSCPPGYRGPSCQDC
DTGYTRTPSGLYLGTC
ERCSCHGHSEACEPETGACQGCQHHTEGPRCEQCQPGYYGDAQR
GTPQDC
QLCPCYGDPAAGQAAHTCFLDTDGHPTCDACSPGHSGRHCERCAPGYYGNPSQG
QPCQ
RDSQVPGPIGCNCDPQGSVSSQCDAAGQCQCKAQVEGLTCSHCRPHHFHLSASNPD
GC
LPCFCMGITQQCASSAYTRHLISTHFAPGDFQGFALVNPQRNSRLTGEFTVEPVPEGA
QLSFGNFAQLGHESFYWQLPETYQGDKVAAYGGKLRYTLSYTAGPQGSPLSDPDVQITGN
NIMLVASQPALQGPERRSYEIMFREEFWRRPDGQPATREHLLMALADLDELLIRATFSSV
PLAASISAVSLEVAQPGPSNRP
RALEVEECRCPPGYIGLSCQDCAPGYTRTGSGLYLGHC
ELCECNGHSDLCHPETGACSQCQHNAAGEFCELCAPGYYGDATAGTPEDCQPCACPLTNP
ENMFSRTCESLGAGGYRCTACEPGYTGQYCEQCGPGYVGNPSVQGGQC
LPETNQAPLVVE
VHPARSIVPQGGSHSLRCQVSGSPPHYFYWSREDGRPVPSGTQQRHQGSELHFPSVQPSD
AGVYICTCRN
LHQSNTSRAELLVTEAPSKPITVTVEEQRSQSVRPGADVTFICTAKSKSP
AYTLVWTRLHNGKLPTRAMDFNGILTIRNVQLSDAGTYVCTGSN
MFAMDQGTATLHVQAS
GTLSAPVVSIHPPQLTVQPGQLAEFRCSATGSPTPTLEWTGGPGGQLPAKAQIHGGILRL
PAVEPTDQAQYLCRAHSSAGQQVARAVLHV
HGGGGPRVQVSPERTQVHAGRTVRLYCRAA
GVPSATITWRKEGGSLPPQARSERTDIATLLIPAITTADAGFYLCVATSPAGTAQARIQV
VV
LSASDASPPPVKIESSSPSVTEGQTLDLNCVVAGSAHAQVTWYRRGGSLPPHTQVHGS
RLRLPQVSPADSGEYVCRVEN
GSGPKEASITVSVLHGTHSGPSYTPVPGSTRPIRIEPSS
SHVAEGQTLDLNCVVPGQAHAQVTWHKRGGSLPARHQTHGSLLRLHQVTPADSGEYVCHV
V
GTSGPLEASVLVTIEASVIPGPIPPVRIESSSSTVAEGQTLDLSCVVAGQAHAQVTWYK
RGGSLPARHQVRGSRLYIFQASPADAGQYVCRASN
GMEASITVTVTGTQGANLAYPAGST
QPIRIEPSSSQVAEGQTLDLNCVVPGQSHAQVTWHKRGGSLPVRHQTHGSLLRLYQASPA
DSGEYVCRVL
GSSVPLEASVLVTIEPAGSVPALGVTPTVRIESSSSQVAEGQTLDLNCLV
AGQAHAQVTWHKRGGSLPARHQVHGSRLRLLQVTPADSGEYVCRVVGSSGTQEASVLVTI

QQRLSGSHSQGVAYPVRIESSSASLANGHTLDLNCLVASQAPHTITWYKRGGSLPSRHQI
VGSRLRIPQVTPADSGEYVCHVSN
GAGSRETSLIVTIQGSGSSHVPSVSPPIRIESSSPT
VVEGQTLDLNCVVARQPQAIITWYKRGGSLPSRHQTHGSHLRLHQMSVADSGEYVCRANN

NIDALEASIVISVSPSAGSPSAPGSSMPIRIESSSSHVAEGETLDLNCVVPGQAHAQVTW
HKRGGSLPSHHQTRGSRLRLHHVSPADSGEYVCRVM
GSSGPLEASVLVTIEASGSSAVHV
PAPGGAPPIRIEPSSSRVAEGQTLDLKCVVPGQAHAQVTWHKRGGNLPARHQVHGPLLRL
NQVSPADSGEYSCQVTGSSGTLEASVLVTI
EPSSPGPIPAPGLAQPIYIEASSSHVTEGQ
TLDLNCVVPGQAHAQVTWYKRGGSLPARHQTHGSQLRLHLVSPADSGEYVCRAASGPGPE
QEASFTVTV
PPSEGSSYRLRSPVISIDPPSSTVQQGQDASFKCLIHDGAAPISLEWKTRN
QELEDNVHISPNGSIITIVGTRPSNHGTYRCVASN
AYGVAQSVVNLSVHGPPTVSVLPEG
PVWVKVGKAVTLECVSAGEPRSSARWTRISSTPAKLEQRTYGLMDSHAVLQISSAKPSDA
GTYVCLAQN
ALGTAQKQVEVIVDTGAMAPGAPQVQAEEAELTVEAGHTATLRCSATGSPA
PTIHWSKLRSPLPWQHRLEGDTLIIPRVAQQDSGQYICNATSPAGHAEATIILHV
ESPPY
ATTVPEHASVQAGETVQLQCLAHGTPPLTFQWSRVGSSLPGRATARNELLHFERAAPEDS
GRYRCRVTNKVGSAEAFAQLLV
QGPPGSLPATSIPAGSTPTVQVTPQLETKSIGASVEFH
CAVPSDRGTQLRWFKEGGQLPPGHSVQDGVLRIQNLDQSCQGTYICQAHGPWGKAQASAQ
LVI
QALPSVLINIRTSVQTVVVGHAVEFECLALGDPKPQVTWSKVGGHLRPGIVQSGGVV
RIAHVELADAGQYRCTATNAAGTTQSHVLLLV
QALPQISMPQEVRVPAGSAAVFPCIASG
YPTPDISWSKLDGSLPPDSRLENNMLMLPSVRPQDAGTYVCTATNRQGKVKAFAHLQV
PE
RVVPYFTQTPYSFLPLPTIKDAYRKFEIKITFRPDSADGMLLYNGQKRVPGSPTNLANRQ
PDFISFGLVGGRPEFRFDAGSGMATIRHPTPLALGHFHTVTLLRSLTQGSLIVGDLAPVN
GTSQGKFQGLDLNEELYLGGYPDYGAIPKAGLSSGFIGCVRELRIQGEEIV
FHDLNLTAH
GISHCPTCRDRPCQNGGQCHDSESSSYVCVCPAGFTGSRCEHSQALHCHPEACGPDATCV
NRPDGRGYTCRCHLGRSGLRCEEGVTVTTPSLSGAGSYLALPALTNTHHELRLDVEFKPL
APDGVLLFSGGKSGPVEDFVSLAMVGGHLEFRYELGSGLAVLRSAEPLALGRWHRVSAER
LNKDGSLRVNGGRPVLRSSPGKSQGLNLHTLLYLGGVEPSVPLSPATNMSAHFRGCVGEV
SVNGKRLD
LTYSFLGSQGIGQCYDSSPCERQPCQHGATCMPAGEYEFQCLCRDGFKGDLC
EHEENPCQLREPCLHGGTCQGTRCLCLPGFSGPRCQQGSGHGIAESDWHLEGSGGNDAPG
QYGAYFHDDGFLAFPGHVFSRSLPEVPETIELEVRTSTASGLLLWQGVEVGEAGQGKDFI
SLGLQDGHLVFRYQLGSGEARLVSEDPINDGEWHRVTALREGRRGSIQVDGEELVSGRSP
GPNVAVNAKGSVYIGGAPDVATLTGGRFSSGITGCVKNLVLHSA
RPGAPPPQPLDLQHRA
QAGANTRPCPS
Sequence length 4391
Interactions View interactions
Pathways Pathway information has different metabolic/signaling pathways associated with genes.
  KEGG  Reactome
  Virion - Hepatitis viruses
ECM-receptor interaction
Cytoskeleton in muscle cells
Hepatitis B
Proteoglycans in cancer
  Degradation of the extracellular matrix
A tetrasaccharide linker sequence is required for GAG synthesis
HS-GAG biosynthesis
HS-GAG degradation
Integrin cell surface interactions
Laminin interactions
Non-integrin membrane-ECM interactions
ECM proteoglycans
Defective B4GALT7 causes EDS, progeroid type
Defective B3GAT3 causes JDSSDHD
Defective EXT2 causes exostoses 2
Defective EXT1 causes exostoses 1, TRPS2 and CHDS
Defective B3GALT6 causes EDSP2 and SEMDJL1
Retinoid metabolism and transport
Amyloid fiber formation
Associated diseases Disease associations from ClinVar categorized as Causal (Pathogenic/Likely Pathogenic) or Unknown.
1662
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Abnormal facial shape Likely pathogenic; Pathogenic rs898553156 RCV002226931
HSPG2-related disorder Pathogenic; Likely pathogenic rs747493127, rs1318029350 RCV004734269
RCV004734270
Lethal Kniest-like syndrome Pathogenic; Likely pathogenic rs111306515, rs769130391, rs2152747090, rs1318029350, rs2152704005, rs1196835948, rs778653296, rs762281715, rs2152721133, rs2550621846, rs1373848573, rs1419062047, rs1294413650 RCV001336825
RCV001726530
RCV001533174
RCV005006043
RCV002249077
RCV003140611
RCV002487170
RCV002283473
RCV000016055
RCV004555793
RCV000985119
RCV001196830
RCV001824170
Malignant tumor of urinary bladder Likely pathogenic; Pathogenic rs778653296 RCV005895449
Unknown Diseases with uncertain, conflicting, or no pathogenic evidence in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession
Acute myeloid leukemia Benign; Likely benign rs55728940, rs12409663, rs74060196, rs12737091, rs59720663, rs74060191, rs35423650 RCV005914620
RCV005919122
RCV005925241
RCV005891629
RCV005891643
RCV005897291
RCV005912277
Adrenocortical carcinoma, hereditary Benign rs12737091 RCV005891632
Cervical cancer Benign; Uncertain significance; Conflicting classifications of pathogenicity; Likely benign rs74060196, rs1162899719, rs200225298, rs77527456, rs74060191, rs77375768 RCV005925243
RCV005920745
RCV005895872
RCV005891625
RCV005897293
RCV005912284
Childhood-onset schizophrenia Conflicting classifications of pathogenicity rs745452577 RCV000202344
Associations from Text MiningDisease associations identified through Pubtator
Disease Name Relationship Type References
AA amyloidosis Associate 36433666
Alzheimer Disease Associate 36928034
Amyotrophic Lateral Sclerosis Associate 32792518
Astrocytoma Associate 30402850
Atrophy Associate 23049933
Balkan Nephropathy Associate 24949484
Bone Diseases Associate 33210551
Brain Neoplasms Associate 26277786
Breast Neoplasms Associate 23436656, 30278461, 33210551
Carcinoma Renal Cell Associate 25184692