Gene Gene information from NCBI Gene database.
Entrez ID 3339
Gene name Heparan sulfate proteoglycan 2
Gene symbol HSPG2
Synonyms (NCBI Gene)
HSPGPLCPRCANSJASJSSJS1
Chromosome 1
Chromosome location 1p36.12
Summary This gene encodes the perlecan protein, which consists of a core protein to which three long chains of glycosaminoglycans (heparan sulfate or chondroitin sulfate) are attached. The perlecan protein is a large multidomain proteoglycan that binds to and cro
SNPs SNP information provided by dbSNP.
42
SNP ID Visualize variation Clinical significance Consequence
rs2229490 C>A,T Conflicting-interpretations-of-pathogenicity Coding sequence variant, missense variant
rs55875654 G>A Uncertain-significance, conflicting-interpretations-of-pathogenicity, likely-benign Coding sequence variant, synonymous variant
rs62642527 C>T Conflicting-interpretations-of-pathogenicity Missense variant, coding sequence variant
rs62642535 T>A,G Conflicting-interpretations-of-pathogenicity Missense variant, coding sequence variant
rs111866498 G>A Uncertain-significance, benign, conflicting-interpretations-of-pathogenicity Synonymous variant, coding sequence variant
miRNA miRNA information provided by mirtarbase database.
156
miRTarBase ID miRNA Experiments Reference
MIRT007259 hsa-miR-663a Luciferase reporter assay 23436656
MIRT007259 hsa-miR-663a Luciferase reporter assay 23436656
MIRT007259 hsa-miR-663a Luciferase reporter assay 23436656
MIRT018272 hsa-miR-335-5p Microarray 18185580
MIRT440478 hsa-miR-218-5p HITS-CLIP 23212916
Gene ontology (GO) Gene Ontology (GO) annotations describing the biological processes, molecular functions, and cellular components associated with a gene.
47
GO ID Ontology Definition Evidence Reference
GO:0001525 Process Angiogenesis IEA
GO:0001540 Function Amyloid-beta binding IC 21289173
GO:0001666 Process Response to hypoxia IEA
GO:0001938 Process Positive regulation of endothelial cell proliferation IEA
GO:0005178 Function Integrin binding TAS 21126803
Other IDs Other IDs provides unique identifiers for this gene in OMIM, HGNC, and Ensembl databases.
MIM HGNC e!Ensembl
142461 5273 ENSG00000142798
Protein Protein information from UniProt database.
UniProt ID Unique identifier for the protein in the UniProt database. Click to view detailed protein information.
P98160
Protein name Basement membrane-specific heparan sulfate proteoglycan core protein (HSPG) (Perlecan) (PLC) [Cleaved into: Endorepellin; LG3 peptide]
Protein function Integral component of basement membranes. Component of the glomerular basement membrane (GBM), responsible for the fixed negative electrostatic membrane charge, and which provides a barrier which is both size- and charge-selective. It serves as
PDB 3SH4 , 3SH5
Family and domains

Pfam

Accession ID Position in sequence Description Type
PF00057 Ldl_recept_a 197 234 Low-density lipoprotein receptor domain class A Repeat
PF00057 Ldl_recept_a 283 319 Low-density lipoprotein receptor domain class A Repeat
PF00057 Ldl_recept_a 323 359 Low-density lipoprotein receptor domain class A Repeat
PF00057 Ldl_recept_a 366 403 Low-density lipoprotein receptor domain class A Repeat
PF13927 Ig_3 405 483 Domain
PF00052 Laminin_B 595 729 Laminin B (Domain IV) Family
PF00053 Laminin_EGF 764 811 Laminin EGF domain Domain
PF00053 Laminin_EGF 814 869 Laminin EGF domain Domain
PF00053 Laminin_EGF 867 921 Laminin EGF domain Domain
PF00052 Laminin_B 990 1124 Laminin B (Domain IV) Family
PF00053 Laminin_EGF 1116 1156 Laminin EGF domain Domain
PF00053 Laminin_EGF 1159 1206 Laminin EGF domain Domain
PF00053 Laminin_EGF 1209 1264 Laminin EGF domain Domain
PF00053 Laminin_EGF 1275 1322 Laminin EGF domain Domain
PF00052 Laminin_B 1396 1528 Laminin B (Domain IV) Family
PF00053 Laminin_EGF 1523 1556 Laminin EGF domain Domain
PF00053 Laminin_EGF 1563 1610 Laminin EGF domain Domain
PF00053 Laminin_EGF 1613 1668 Laminin EGF domain Domain
PF13927 Ig_3 1676 1750 Domain
PF13927 Ig_3 1769 1844 Domain
PF07679 I-set 1866 1950 Immunoglobulin I-set domain Domain
PF07679 I-set 1956 2042 Immunoglobulin I-set domain Domain
PF13927 Ig_3 2050 2121 Domain
PF13927 Ig_3 2153 2221 Domain
PF13927 Ig_3 2245 2315 Domain
PF13927 Ig_3 2342 2410 Domain
PF07679 I-set 2437 2520 Immunoglobulin I-set domain Domain
PF13927 Ig_3 2535 2604 Domain
PF13927 Ig_3 2629 2700 Domain
PF13927 Ig_3 2728 2796 Domain
PF07679 I-set 2827 2910 Immunoglobulin I-set domain Domain
PF13895 Ig_2 2926 3009 Immunoglobulin domain Domain
PF13927 Ig_3 3021 3095 Domain
PF13927 Ig_3 3111 3189 Domain
PF07679 I-set 3212 3295 Immunoglobulin I-set domain Domain
PF07679 I-set 3299 3382 Immunoglobulin I-set domain Domain
PF07679 I-set 3400 3483 Immunoglobulin I-set domain Domain
PF07679 I-set 3489 3572 Immunoglobulin I-set domain Domain
PF07679 I-set 3576 3658 Immunoglobulin I-set domain Domain
PF00054 Laminin_G_1 3692 3831 Laminin G domain Domain
PF00008 EGF 3848 3879 EGF-like domain Domain
PF00054 Laminin_G_1 3957 4088 Laminin G domain Domain
PF00008 EGF 4108 4139 EGF-like domain Domain
PF02210 Laminin_G_2 4234 4364 Laminin G domain Domain
Tissue specificity TISSUE SPECIFICITY: Detected in cerebrospinal fluid, fibroblasts and urine (at protein level). {ECO:0000269|PubMed:25326458, ECO:0000269|PubMed:36213313}.
Sequence
MGWRAAGALLLALLLHGRLLAVTHGLRAYDGLSLPEDIETVTASQMRWTHSYLSDDEDML
ADSISGDDLGSGDLGSGDFQMVYFRALVNFTRSIEYSPQLEDAGSREFREVSEAVVDTLE
SEYLKIPGDQVVSVVFIKELDGWVFVELDVGSEGNADGAQIQEMLLRVISSGSVASYVTS
PQGFQFRRLGTVPQFPRACTEAEFACHSYNECVALEYRCDRRPDCRDMSDELNCEEPVLG
ISPTFSLLVETTSLPPRPETTIMRQPPVTHAPQPLLPGSVRPLPCGPQEAACRNGHCIPR
DYLCDGQEDCEDGSDELDC
GPPPPCEPNEFPCGNGHCALKLWRCDGDFDCEDRTDEANCP
TKRPEEVCGPTQFRCVSTNMCIPASFHCDEESDCPDRSDEFGCMPPQVVTPPRESIQASR
GQTVTFTCVAIGVPTPIINWRLNWGHIPSHPRVTVTSEGGRGTLIIRDVKESDQGAYTCE
AMN
ARGMVFGIPDGVLELVPQRGPCPDGHFYLEHSAACLPCFCFGITSVCQSTRRFRDQI
RLRFDQPDDFKGVNVTMPAQPGTPPLSSTQLQIDPSLHEFQLVDLSRRFLVHDSFWALPE
QFLGNKVDSYGGSLRYNVRYELARGMLEPVQRPDVVLMGAGYRLLSRGHTPTQPGALNQR
QVQFSEEHWVHESGRPVQRAELLQVLQSLEAVLIQTVYNTKMASVGLSDIAMDTTVTHAT
SHGRAHSVE
ECRCPIGYSGLSCESCDAHFTRVPGGPYLGTCSGCNCNGHASSCDPVYGHC
LNCQHNTEGPQCNKCKAGFFGDAMKATATSC
RPCPCPYIDASRRFSDTCFLDTDGQATCD
ACAPGYTGRRCESCAPGYEGNPIQPG
GKCRPVNQEIVRCDERGSMGTSGEACRCKNNVVG
RLCNECADGSFHLSTRNPDGC
LKCFCMGVSRHCTSSSWSRAQLHGASEEPGHFSLTNAAS
THTTNEGIFSPTPGELGFSSFHRLLSGPYFWSLPSRFLGDKVTSYGGELRFTVTQRSQPG
STPLHGQPLVVLQGNNIILEHHVAQEPSPGQPSTFIVPFREQAWQRPDGQPATREHLLMA
LAGIDTLLIRASYAQQPAESRVSGISMDVAVPEET
GQDPALEVEQCSCPPGYRGPSCQDC
DTGYTRTPSGLYLGTC
ERCSCHGHSEACEPETGACQGCQHHTEGPRCEQCQPGYYGDAQR
GTPQDC
QLCPCYGDPAAGQAAHTCFLDTDGHPTCDACSPGHSGRHCERCAPGYYGNPSQG
QPCQ
RDSQVPGPIGCNCDPQGSVSSQCDAAGQCQCKAQVEGLTCSHCRPHHFHLSASNPD
GC
LPCFCMGITQQCASSAYTRHLISTHFAPGDFQGFALVNPQRNSRLTGEFTVEPVPEGA
QLSFGNFAQLGHESFYWQLPETYQGDKVAAYGGKLRYTLSYTAGPQGSPLSDPDVQITGN
NIMLVASQPALQGPERRSYEIMFREEFWRRPDGQPATREHLLMALADLDELLIRATFSSV
PLAASISAVSLEVAQPGPSNRP
RALEVEECRCPPGYIGLSCQDCAPGYTRTGSGLYLGHC
ELCECNGHSDLCHPETGACSQCQHNAAGEFCELCAPGYYGDATAGTPEDCQPCACPLTNP
ENMFSRTCESLGAGGYRCTACEPGYTGQYCEQCGPGYVGNPSVQGGQC
LPETNQAPLVVE
VHPARSIVPQGGSHSLRCQVSGSPPHYFYWSREDGRPVPSGTQQRHQGSELHFPSVQPSD
AGVYICTCRN
LHQSNTSRAELLVTEAPSKPITVTVEEQRSQSVRPGADVTFICTAKSKSP
AYTLVWTRLHNGKLPTRAMDFNGILTIRNVQLSDAGTYVCTGSN
MFAMDQGTATLHVQAS
GTLSAPVVSIHPPQLTVQPGQLAEFRCSATGSPTPTLEWTGGPGGQLPAKAQIHGGILRL
PAVEPTDQAQYLCRAHSSAGQQVARAVLHV
HGGGGPRVQVSPERTQVHAGRTVRLYCRAA
GVPSATITWRKEGGSLPPQARSERTDIATLLIPAITTADAGFYLCVATSPAGTAQARIQV
VV
LSASDASPPPVKIESSSPSVTEGQTLDLNCVVAGSAHAQVTWYRRGGSLPPHTQVHGS
RLRLPQVSPADSGEYVCRVEN
GSGPKEASITVSVLHGTHSGPSYTPVPGSTRPIRIEPSS
SHVAEGQTLDLNCVVPGQAHAQVTWHKRGGSLPARHQTHGSLLRLHQVTPADSGEYVCHV
V
GTSGPLEASVLVTIEASVIPGPIPPVRIESSSSTVAEGQTLDLSCVVAGQAHAQVTWYK
RGGSLPARHQVRGSRLYIFQASPADAGQYVCRASN
GMEASITVTVTGTQGANLAYPAGST
QPIRIEPSSSQVAEGQTLDLNCVVPGQSHAQVTWHKRGGSLPVRHQTHGSLLRLYQASPA
DSGEYVCRVL
GSSVPLEASVLVTIEPAGSVPALGVTPTVRIESSSSQVAEGQTLDLNCLV
AGQAHAQVTWHKRGGSLPARHQVHGSRLRLLQVTPADSGEYVCRVVGSSGTQEASVLVTI

QQRLSGSHSQGVAYPVRIESSSASLANGHTLDLNCLVASQAPHTITWYKRGGSLPSRHQI
VGSRLRIPQVTPADSGEYVCHVSN
GAGSRETSLIVTIQGSGSSHVPSVSPPIRIESSSPT
VVEGQTLDLNCVVARQPQAIITWYKRGGSLPSRHQTHGSHLRLHQMSVADSGEYVCRANN

NIDALEASIVISVSPSAGSPSAPGSSMPIRIESSSSHVAEGETLDLNCVVPGQAHAQVTW
HKRGGSLPSHHQTRGSRLRLHHVSPADSGEYVCRVM
GSSGPLEASVLVTIEASGSSAVHV
PAPGGAPPIRIEPSSSRVAEGQTLDLKCVVPGQAHAQVTWHKRGGNLPARHQVHGPLLRL
NQVSPADSGEYSCQVTGSSGTLEASVLVTI
EPSSPGPIPAPGLAQPIYIEASSSHVTEGQ
TLDLNCVVPGQAHAQVTWYKRGGSLPARHQTHGSQLRLHLVSPADSGEYVCRAASGPGPE
QEASFTVTV
PPSEGSSYRLRSPVISIDPPSSTVQQGQDASFKCLIHDGAAPISLEWKTRN
QELEDNVHISPNGSIITIVGTRPSNHGTYRCVASN
AYGVAQSVVNLSVHGPPTVSVLPEG
PVWVKVGKAVTLECVSAGEPRSSARWTRISSTPAKLEQRTYGLMDSHAVLQISSAKPSDA
GTYVCLAQN
ALGTAQKQVEVIVDTGAMAPGAPQVQAEEAELTVEAGHTATLRCSATGSPA
PTIHWSKLRSPLPWQHRLEGDTLIIPRVAQQDSGQYICNATSPAGHAEATIILHV
ESPPY
ATTVPEHASVQAGETVQLQCLAHGTPPLTFQWSRVGSSLPGRATARNELLHFERAAPEDS
GRYRCRVTNKVGSAEAFAQLLV
QGPPGSLPATSIPAGSTPTVQVTPQLETKSIGASVEFH
CAVPSDRGTQLRWFKEGGQLPPGHSVQDGVLRIQNLDQSCQGTYICQAHGPWGKAQASAQ
LVI
QALPSVLINIRTSVQTVVVGHAVEFECLALGDPKPQVTWSKVGGHLRPGIVQSGGVV
RIAHVELADAGQYRCTATNAAGTTQSHVLLLV
QALPQISMPQEVRVPAGSAAVFPCIASG
YPTPDISWSKLDGSLPPDSRLENNMLMLPSVRPQDAGTYVCTATNRQGKVKAFAHLQV
PE
RVVPYFTQTPYSFLPLPTIKDAYRKFEIKITFRPDSADGMLLYNGQKRVPGSPTNLANRQ
PDFISFGLVGGRPEFRFDAGSGMATIRHPTPLALGHFHTVTLLRSLTQGSLIVGDLAPVN
GTSQGKFQGLDLNEELYLGGYPDYGAIPKAGLSSGFIGCVRELRIQGEEIV
FHDLNLTAH
GISHCPTCRDRPCQNGGQCHDSESSSYVCVCPAGFTGSRCEHSQALHCHPEACGPDATCV
NRPDGRGYTCRCHLGRSGLRCEEGVTVTTPSLSGAGSYLALPALTNTHHELRLDVEFKPL
APDGVLLFSGGKSGPVEDFVSLAMVGGHLEFRYELGSGLAVLRSAEPLALGRWHRVSAER
LNKDGSLRVNGGRPVLRSSPGKSQGLNLHTLLYLGGVEPSVPLSPATNMSAHFRGCVGEV
SVNGKRLD
LTYSFLGSQGIGQCYDSSPCERQPCQHGATCMPAGEYEFQCLCRDGFKGDLC
EHEENPCQLREPCLHGGTCQGTRCLCLPGFSGPRCQQGSGHGIAESDWHLEGSGGNDAPG
QYGAYFHDDGFLAFPGHVFSRSLPEVPETIELEVRTSTASGLLLWQGVEVGEAGQGKDFI
SLGLQDGHLVFRYQLGSGEARLVSEDPINDGEWHRVTALREGRRGSIQVDGEELVSGRSP
GPNVAVNAKGSVYIGGAPDVATLTGGRFSSGITGCVKNLVLHSA
RPGAPPPQPLDLQHRA
QAGANTRPCPS
Sequence length 4391
Interactions View interactions
Associated diseases Disease associations from ClinVar (causal & non-causal) and other databases (OMIM, Orphanet, GWAS, etc.).
56
Evidence Score: ★☆☆☆☆  Gene-disease association found in Text Mining only ★★☆☆☆  Found in Text Mining and Unknown/Other Associations ★★★☆☆  Reported in Unknown/Other Associations across ≥2 Sources ★★★★☆  ClinVar: Pathogenic/Likely Pathogenic (<5 Variants) ★★★★★  ClinVar: Pathogenic/Likely Pathogenic (≥5 Variants)
Causal Diseases associated with Pathogenic or Likely Pathogenic variants in ClinVar
Phenotype Name Clinical Significance dbSNP ID RCV Accession Evidence Score
Abnormal facial shape Likely pathogenic; Pathogenic rs898553156 RCV002226931
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
HSPG2-related disorder Pathogenic; Likely pathogenic rs747493127, rs1318029350 RCV004734269
RCV004734270
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
Lethal Kniest-like syndrome Pathogenic; Likely pathogenic rs111306515, rs769130391, rs2152747090, rs1318029350, rs2152704005, rs1196835948, rs778653296, rs762281715, rs2152721133, rs2550621846, rs1373848573, rs1419062047, rs1294413650 RCV001336825
RCV001726530
RCV001533174
RCV005006043
RCV002249077
View all (8 more)
★★★★★
ClinVar: Pathogenic / Likely Pathogenic (≥5 Variants)
Malignant tumor of urinary bladder Likely pathogenic; Pathogenic rs778653296 RCV005895449
★★★★☆
ClinVar: Pathogenic / Likely Pathogenic (<5 Variants)
Unknown / Other Associations ClinVar entries with uncertain/conflicting evidence, and associations from other databases (OMIM, Orphanet, GWAS, etc.) where the gene is not established as causal.
Phenotype Name Clinical Significance Source Evidence Score
1P36 DELETION SYNDROME Orphanet
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Acute myeloid leukemia Benign; Likely benign ClinVar
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Adrenocortical carcinoma, hereditary Benign ClinVar
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
ALZHEIMER DISEASE GWAS catalog
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Associations from Text Mining Disease associations identified through text mining
Disease Name Disease (Merged) Source PMID Relationship Type Evidence Score
Acquired Kyphoscoliosis Acquired Kyphoscoliosis HPO_DG
★☆☆☆☆
Found in Text Mining only
Acute Erythroblastic Leukemia Erythroblastic Leukemia BEFREE 7544867
★☆☆☆☆
Found in Text Mining only
Acute Promyelocytic Leukemia Promyelocytic Leukemia BEFREE 12050180
★☆☆☆☆
Found in Text Mining only
Adenoid Cystic Carcinoma Adenocarcinoma BEFREE 10993271
★☆☆☆☆
Found in Text Mining only
Adrenoleukodystrophy Adrenoleukodystrophy BEFREE 30426534
★☆☆☆☆
Found in Text Mining only
Adult Oligodendroglioma Oligodendroglioma BEFREE 30402850
★☆☆☆☆
Found in Text Mining only
Age related macular degeneration Age-related macular degeneration BEFREE 28159936
★☆☆☆☆
Found in Text Mining only
Alzheimer Disease Alzheimer disease Pubtator 36928034 Associate
★★☆☆☆
Found in Text Mining + Unknown/Other Associations
Alzheimer`s Disease Alzheimer disease GWASCAT_DG 26830138
★☆☆☆☆
Found in Text Mining only
Ameloblastoma Ameloblastoma BEFREE 12189507, 14720194, 21255062, 29921372
★☆☆☆☆
Found in Text Mining only