ENV - Envelope polyprotein
HIV ENV is a polyprotein precursor (gp160 ), which is processed by cellular proteases to give a non-covalent complex of an external glycoprotein (SU / gp120 ) and a transmembrane glycoprotein (TM / gp41 )
[animated gif: 320x240 ]
Description:
HIV ENV is a polyprotein precursor (gp160 ), which is processed by cellular proteases to give a non-covalent complex of an external glycoprotein (SU / gp120 ) and a transmembrane glycoprotein (TM / gp41 ).
Localization:
Cell plasma membrane
Virion envelope
Function:
Mediates virion attachment and entry
Polyprotein precursor for glycoproteins that occur on the surface of virions
Fsorm complexes with CD4 receptors within Endoplasmic Reticulum, decreases their presence on the cell surface
Additional
Information:
translated from singly spliced mRNA
synthesized in the Endoplasmic Reticulum
migrates through the Golgi complex
undergoes glycosylation within Golgi complex
glycosylation consists of 25 to 30 complex N-linked carbohydrate side chains on Asparagine residues
glycosylation is required for infectivity (Ref. #3)
Genomic Location: [TOP ]
Reference
Sequences:
HIV-1
(HXB2):
10 20 30 40 50 60 70 | | | | | | | MRVKEKYQHL WRWGWRWGTM LLGMLMICSA TEKLWVTVYY GVPVWKEATT TLFCASDAKA YDTEVHNVWA 80 90 100 110 120 130 140 | | | | | | | THACVPTDPN PQEVVLVNVT ENFNMWKNDM VEQMHEDIIS LWDQSLKPCV KLTPLCVSLK CTDLKNDTNT 150 160 170 180 190 200 210 | | | | | | | NSSSGRMIME KGEIKNCSFN ISTSIRGKVQ KEYAFFYKLD IIPIDNDTTS YKLTSCNTSV ITQACPKVSF 220 230 240 250 260 270 280 | | | | | | | EPIPIHYCAP AGFAILKCNN KTFNGTGPCT NVSTVQCTHG IRPVVSTQLL LNGSLAEEEV VIRSVNFTDN 290 300 310 320 330 340 350 | | | | | | | AKTIIVQLNT SVEINCTRPN NNTRKRIRIQ RGPGRAFVTI GKIGNMRQAH CNISRAKWNN TLKQIASKLR 360 370 380 390 400 410 420 | | | | | | | EQFGNNKTII FKQSSGGDPE IVTHSFNCGG EFFYCNSTQL FNSTWFNSTW STEGSNNTEG SDTITLPCRI 430 440 450 460 470 480 490 | | | | | | | KQIINMWQKV GKAMYAPPIS GQIRCSSNIT GLLLTRDGGN SNNESEIFRP GGGDMRDNWR SELYKYKVVK 500 510 520 530 540 550 560 | | | | | | | IEPLGVAPTK AKRRVVQREK RAVGIGALFL GFLGAAGSTM GAASMTLTVQ ARQLLSGIVQ QQNNLLRAIE 570 580 590 600 610 620 630 | | | | | | | AQQHLLQLTV WGIKQLQARI LAVERYLKDQ QLLGIWGCSG KLICTTAVPW NASWSNKSLE QIWNHTTWME 640 650 660 670 680 690 700 | | | | | | | WDREINNYTS LIHSLIEESQ NQQEKNEQEL LELDKWASLW NWFNITNWLW YIKLFIMIVG GLVGLRIVFA 710 720 730 740 750 760 770 | | | | | | | VLSIVNRVRQ GYSPLSFQTH LPTPRGPDRP EGIEEEGGER DRDRSIRLVN GSLALIWDDL RSLCLFSYHR 780 790 800 810 820 830 840 | | | | | | | LRDLLLIVTR IVELLGRRGW EALKYWWNLL QYWSQELKNS AVSLLNATAI AVAEGTDRVI EVVQGACRAI 850 | RHIPRRIRQG LERILL [download in fasta format ]
Length:
856 amino acids
Molecular Weight: 97212 Da
Theoretical pI: 9.17
Protein Domains/Folds/Motifs: [TOP ]
InterPro signature for gp41 - IPR000328 InterPro signature for gp120 - IPR000777
Secondary Structure prediction:
Transmembrane Regions - tmhmm:
N-glycosylation:
31 potential sites
NVTE (88 - 91)
NDTN (136 - 139)
NSSS (141 - 144)
NCSF (156 - 159)
NIST (160 - 163)
NDTT (186 - 189)
NTSV (197 - 200)
NKTF (230 - 233)
NGTG (234 - 237)
NVST (241 - 244)
NGSL (262 - 265)
NFTD (276 - 279)
NTSV (289 - 292)
NCTR (295 - 298)
NNTR (301 - 304)
NISR (332 - 335)
NNTL (339 - 342)
NKTI (356 - 359)
NSTQ (386 - 389)
NSTW (392 - 395)
NSTW (397 - 400)
NNTE (406 - 409)
NITG (448 - 451)
NESE (463 - 466)
NASW (611 - 614)
NKSL (616 - 619)
NHTT (624 - 627)
NYTS (637 - 640)
NITN (674 - 677)
NGSL (750 - 753)
NATA (816 - 819)
N-myristoylation:
13 potential sites
GTgpCT (235 - 240)
GNnkTI (354 - 359)
GSnnTE (404 - 409)
GQirCS (441 - 446)
GLllTR (451 - 456)
GGnsNN (458 - 463)
GNsnNE (459 - 464)
GVapTK (495 - 500)
GAagST (524 - 529)
GStmGA (527 - 532)
GIwgCS (594 - 599)
GGlvGL (690 - 695)
GAcrAI (835 - 840)
Amidation:
1 potential site
lGRR (785 - 788)
Protein kinase C:
13 potential sites
TeK (31 - 33)
SlK (115 - 117)
SlK (128 - 130)
SgR (144 - 146)
SiR (164 - 166)
SyK (190 - 192)
TrK (303 - 305)
TlK (341 - 343)
SgK (599 - 601)
SnK (615 - 617)
TpR (723 - 725)
SiR (745 - 747)
TdR (826 - 828)
Casein kinase II:
8 potential sites
SatE (29 - 32)
SlwD (110 - 113)
SlaE (264 - 267)
TsvE (290 - 293)
SggD (365 - 368)
SnnE (461 - 464)
TwmE (627 - 630)
SliE (644 - 647)
Tyrosine kinase:
1 potential site
RrgwEalkY (787 - 795)
cAMP / cGMP kinase:
none
Cell attachment motif:
none
Asp Protease motif:
none
Asp Prot Retro motif:
none
Cysteine-rich Region:
none
Tryptophan-rich Region:
none
Zinc-finger CCHC motif:
none
Leucine Zipper motif:
1 potential site
(793 - 814)
Protein-Protein Interactions: [TOP ]
Host endoproteases (including furin )
cleave HIV Env polyprotein in the highly conserved domain at the
carboxy terminus, consisting of residues: Arg-X-Lys/Arg-Arg (Ref. #4)
Env binds to the molecular chaperone BiP
Primary and Secondary Database Entries: [TOP ]
Identifiers:
ViralZone: HIV-1 PDB/MMDB: Search for HIV-1 & Env
SwissProt: P04578 (HIV-1 HXB2 ENV)
EMBL: K03455; AAB50262.1 [EMBL /GenBank /DDBJ ]
PIR: UNKNOWN
HIV: K03455; ENV$HXB2
InterPro: IPR000328 - gp41 family / IPR000777 - gp120 family
Pfam: PF00517 - gp41 family / PF00517 - gp120 family
Prints: none
ProDom:
PD616511 (3 - 23) /
PD551838 (20 - 194) /
PD597348 (24 - 82) /
PD553839 (82 - 132)
PD000193 (83 - 133) /
PD000124 (83 - 172) /
PD529973 (85 - 132) /
PD591903 (128 - 157)
PD579607 (158 - 193)/
PD499504 (158 - 187) /
PD000123 (178 - 203) /
PD000124 (195 - 492)
PD614419 (330 - 391) /
PD561449 (368 - 491) /
PD527975 (403 - 492) /
PD555302 (486 - 542)
PD598762 (493 - 541)/
PD001599 (508 - 679) /
PD000125 (542 - 591) /
PD605682 (592 - 618)
PD460876 (593 - 640) /
PD628734 (619 - 640)/
PD657031 (620 - 639) /
PD553288 (664 - 715)
PD653931 (670 - 716)/
PD000116 (670 - 716) /
PD000205 (717 - 795) /
PD000745 (796 - 856)
PD000212 (797 - 843)
SCOP: SSF56502 - gp120 core / SSF52440 - preATP-grasp domain / SSF56349 - DNA breaking-rejoining motif
BLOCKS: P04578
Prosite: P04578
ProtoNet: P04578
ProtoMap: P04578
PRESAGE: P04578
Database of Interacting Proteins: P04578
ModBase: P04578
Swiss-2DPAGE: 2D gel
BioAfrica Tools:
- Env Protein Data Mining Tool provides real-time analysis of HIV-1 Env isolates
- HIV Structure BLAST searches for similar HIV sequences that have known structures
- HIV Proteomics Resource contains protein sequence and structure analysis tools
Reviews and References: [TOP ]
1 - HIV Sequence Compendium 2000
Kuiken CL, Foley B, Hahn B, Korber B, Marx PA, McCutchan F, Mellors JW, Mullins JI, Sodroski J, Wolinksy S.
Theoretical Biol. & Biophys. Group, Los Alamos Nat Lab , LA-UR 01-3860 [Read it online: Compendium ]
2 - Retroviruses
Coffin JM, Hughes SH, Varmus HE.
CD-ROM ed. (2002) Cold Spring Harbor Laboratory Press [Read it online: NCBI Bookshelf ]
3 - The CD4-gp120 interaction and AIDS pathogenesis.
Capon DJ, Ward RH.
Annu Rev Immunol. 9: 649-678 (1991) [pubmed: 92000518 ]
4 - Maturation of HIV envelope glycoprotein precursors by cellular endoproteases
Moulard M, Decroly E.
Biochim Biophys Acta 1469(3): 121-132 (2000) [pubmed: 11063880 ]