Gag polyprotein - P05889 (GAG_HV1W2)


Protein Feature View of PDB entries mapped to a UniProtKB sequence  

Mediates, with Gag-Pol polyprotein, the essential events in virion assembly, including binding the plasma membrane, making the protein-protein interactions necessary to create spherical particles, recruiting the viral Env proteins, and packaging the genomic RNA via direct interactions with the RNA packaging sequence (Psi). UniProt
Pathway Maps
      ESCHER  BiGG
Subunit Structure
Gag polyprotein: Homotrimer; further assembles as hexamers of trimers. Oligomerization possibly creates a central hole into which the cytoplasmic tail of the gp41 envelope protein may be inserted.Gag polyprotein: Interacts with host TRIM22; this interaction seems to disrupt proper trafficking of Gag polyprotein and may interfere with budding. Gag polyprotein: Interacts with host PDZD8. Matrix protein p17: Homotrimer; further assembles as hexamers of trimers. Matrix protein p17: Interacts with gp41 (via C-terminus). Matrix protein p17: Interacts with host CALM1; this interaction induces a conformational change in the Matrix protein, triggering exposure of the myristate group. Matrix protein p17: Interacts with host AP3D1; this interaction allows the polyprotein trafficking to multivesicular bodies during virus assembly. Matrix protein p17: Part of the pre-integration complex (PIC) which is composed of viral genome, matrix protein, Vpr and integrase. Capsid protein p24: Homodimer; the homodimer further multimerizes as homohexamers or homopentamers. Capsid protein p24: Interacts with human PPIA/CYPA. Capsid protein p24: Interacts with human NUP153. Capsid protein p24: Interacts with host PDZD8; this interaction stabilizes the capsid. Capsid protein p24: Interacts with monkey TRIM5; this interaction destabilizes the capsid. p6-gag interacts with Vpr; this interaction allows Vpr incorporation into the virion. p6-gag interacts with host TSG101. p6-gag interacts with host PDCD6IP/AIP1. Gag polyprotein: When ubiquitinated, interacts (via p6-gag domain) with host PACSIN2; this interaction allows PACSIN2 recruitment to viral assembly sites and its subsequent incorporation into virions (By similarity). UniProt
Late-budding domains (L domains) are short sequence motifs essential for viral particle budding. They recruit proteins of the host ESCRT machinery (Endosomal Sorting Complex Required for Transport) or ESCRT-associated proteins. p6-gag contains two L domains: a PTAP/PSAP motif, which interacts with the UEV domain of TSG101 and a LYPX(n)L motif which interacts with PDCD6IP/AIP1. UniProt
  • Organism: isolate HXB2
  • Length:
  • UniProt
  • Isoforms: 2 , currently showing only the 'canonical' sequence.
  • Other Gene names: gag
This protein in other organisms (by gene name):
The Protein Feature View requires a browser that supports SVG (Scalable Vector Graphics). Mouse over tracks and labels for more information.
Data origin/color codes
The vertical color bar on the left side indicates data provenance.
Data in green originates from UniProtKB  
Variation data (sourced from UniProt) shows non-genetic variation from the ExPASy   and dbSNP   websites.
Data in yellow originates from Pfam  , by interacting with the HMMER3 web site  
Data in purple originates from Phosphosite  .
Data in orange originates from the SCOP   (version 1.75) and SCOPe   (version 2.04) classifications.
Data in grey has been calculated using BioJava  . Protein disorder predictions are based on JRONN (Troshin, P. and Barton, G. J. unpublished), a Java implementation of RONN  
  • Red: potentially disorderd region
  • Blue: probably ordered region.
Hydropathy has been calculated using a sliding window of 15 residues and summing up scores from standard hydrophobicity tables.
  • Red: hydrophobic
  • Blue: hydrophilic.
Data in lilac represent the genomic exon structure projected onto the UniProt sequence.
Data in blue originates from PDB
  • Secstruc: Secondary structure projected from representative PDB entries onto the UniProt sequence.
Sequence Mismatches It is now possible to see information about expression tags, cloning artifacts, and many other details related to sequence mismatches.
Icons represent a number of different sequence modifications that can be observed in PDB files. For example the 'T' icon T represents expression tags that have been added to the sequence. The 'E' icon E represents an engineered mutation. However, besides these two, there are many other icons. For more information about the meaning and exact position of a sequence modification, move the cursor over the icon.
Validation Track

For more details on the Validation Track (Structure Summary Page only) see the dedicated help page.

Data in red indicates combined ranges of Homology Models from the SWISS-MODEL Repository  
The PDB to UniProt mapping is based on the data provided by the EBI SIFTS project. See also Velankar et al., Nucleic Acids Research 33, D262-265 (2005).
Organism icons generated by under CC BY. The authors are: Freepik, Icons8, OCHA, Scott de Jonge.

For more details on the Protein Feature view see the dedicated help page.