8XYU | pdb_00008xyu

De novo designed protein GPX4-3


Experimental Data Snapshot

  • Method: X-RAY DIFFRACTION
  • Resolution: 1.59 Å
  • R-Value Free: 
    0.224 (Depositor), 0.227 (DCC) 
  • R-Value Work: 
    0.199 (Depositor), 0.198 (DCC) 
  • R-Value Observed: 
    0.200 (Depositor) 

Starting Model: in silico
View more details

wwPDB Validation 3D Report Full Report

Validation slider image for 8XYU

This is version 1.2 of the entry. See complete history

Literature

All-Atom Protein Sequence Design Based on Geometric Deep Learning.

Liu, J.Guo, Z.You, H.Zhang, C.Lai, L.

(2024) Angew Chem Int Ed Engl 63: e202411461-e202411461

  • DOI: https://doi.org/10.1002/anie.202411461
  • Primary Citation Related Structures: 
    8XYR, 8XYS, 8XYT, 8XYU, 8XYV, 8XYW

  • PubMed Abstract: 

    Designing sequences for specific protein backbones is a key step in creating new functional proteins. Here, we introduce GeoSeqBuilder, a deep learning framework that integrates protein sequence generation with side chain conformation prediction to produce the complete all-atom structures for designed sequences. GeoSeqBuilder uses spatial geometric features from protein backbones and explicitly includes three-body interactions of neighboring residues. GeoSeqBuilder achieves native residue type recovery rate of 51.6%, comparable to ProteinMPNN and  other leading methods, while accurately predicting side chain conformations. We first used GeoSeqBuilder to design sequences for thioredoxin and a hallucinated three-helical bundle protein. All the 15 tested sequences expressed as soluble monomeric proteins with high thermal stability, and the 2 high-resolution crystal structures solved closely match the designed models. The generated protein sequences exhibit low similarity (minimum 23%) to the original sequences, with significantly altered hydrophobic cores. We further redesigned the hydrophobic core of glutathione peroxidase 4, and 3 of the 5 designs showed improved enzyme activity. Although further testing is needed, the high experimental success rate in our testing demonstrates that GeoSeqBuilder is a powerful tool for designing novel sequences for predefined protein structures with atomic details. GeoSeqBuilder is available at https://github.com/PKUliujl/GeoSeqBuilder.


  • Organizational Affiliation
    • Peking University, Center for Life Sciences, 5 Yiheyuan Road, 100871, Beijing, CHINA.

Macromolecule Content 

  • Total Structure Weight: 20.26 kDa 
  • Atom Count: 1,532 
  • Modeled Residue Count: 169 
  • Deposited Residue Count: 176 
  • Unique protein chains: 1

Macromolecules

Find similar proteins by:|  3D Structure
Entity ID: 1
MoleculeChains  Sequence LengthOrganismDetailsImage
De novo designed GPX4-3176synthetic constructMutation(s): 0 

Experimental Data & Validation

Experimental Data

  • Method: X-RAY DIFFRACTION
  • Resolution: 1.59 Å
  • R-Value Free:  0.224 (Depositor), 0.227 (DCC) 
  • R-Value Work:  0.199 (Depositor), 0.198 (DCC) 
  • R-Value Observed: 0.200 (Depositor) 
Space Group: P 31 2 1
Unit Cell:
Length ( Å )Angle ( ˚ )
a = 61.371α = 90
b = 61.371β = 90
c = 113.205γ = 120
Software Package:
Software NamePurpose
PHENIXrefinement
autoPROCdata reduction
Aimlessdata scaling
PHASERphasing

Structure Validation

View Full Validation Report



Entry History 

& Funding Information

Deposition Data


Funding OrganizationLocationGrant Number
National Natural Science Foundation of China (NSFC)China--

Revision History  (Full details and data files)

  • Version 1.0: 2024-10-02
    Type: Initial release
  • Version 1.1: 2024-12-18
    Changes: Database references, Structure summary
  • Version 1.2: 2025-04-23
    Changes: Database references