8XYV | pdb_00008xyv

De novo designed protein 0705-5


Experimental Data Snapshot

  • Method: X-RAY DIFFRACTION
  • Resolution: 1.81 Å
  • R-Value Free: 
    0.259 (Depositor), 0.297 (DCC) 
  • R-Value Work: 
    0.216 (Depositor) 
  • R-Value Observed: 
    0.219 (Depositor) 

Starting Model: in silico
View more details

wwPDB Validation 3D Report Full Report

Validation slider image for 8XYV

This is version 1.2 of the entry. See complete history

Literature

All-Atom Protein Sequence Design Based on Geometric Deep Learning.

Liu, J.Guo, Z.You, H.Zhang, C.Lai, L.

(2024) Angew Chem Int Ed Engl 63: e202411461-e202411461

  • DOI: https://doi.org/10.1002/anie.202411461
  • Primary Citation Related Structures: 
    8XYR, 8XYS, 8XYT, 8XYU, 8XYV, 8XYW

  • PubMed Abstract: 

    Designing sequences for specific protein backbones is a key step in creating new functional proteins. Here, we introduce GeoSeqBuilder, a deep learning framework that integrates protein sequence generation with side chain conformation prediction to produce the complete all-atom structures for designed sequences. GeoSeqBuilder uses spatial geometric features from protein backbones and explicitly includes three-body interactions of neighboring residues. GeoSeqBuilder achieves native residue type recovery rate of 51.6%, comparable to ProteinMPNN and  other leading methods, while accurately predicting side chain conformations. We first used GeoSeqBuilder to design sequences for thioredoxin and a hallucinated three-helical bundle protein. All the 15 tested sequences expressed as soluble monomeric proteins with high thermal stability, and the 2 high-resolution crystal structures solved closely match the designed models. The generated protein sequences exhibit low similarity (minimum 23%) to the original sequences, with significantly altered hydrophobic cores. We further redesigned the hydrophobic core of glutathione peroxidase 4, and 3 of the 5 designs showed improved enzyme activity. Although further testing is needed, the high experimental success rate in our testing demonstrates that GeoSeqBuilder is a powerful tool for designing novel sequences for predefined protein structures with atomic details. GeoSeqBuilder is available at https://github.com/PKUliujl/GeoSeqBuilder.


  • Organizational Affiliation
    • Peking University, Center for Life Sciences, 5 Yiheyuan Road, 100871, Beijing, CHINA.

Macromolecule Content 

  • Total Structure Weight: 24.26 kDa 
  • Atom Count: 1,625 
  • Modeled Residue Count: 197 
  • Deposited Residue Count: 218 
  • Unique protein chains: 1

Macromolecules

Find similar proteins by:|  3D Structure
Entity ID: 1
MoleculeChains  Sequence LengthOrganismDetailsImage
De novo designed protein 0705-5
A, B
109synthetic constructMutation(s): 0 

Experimental Data & Validation

Experimental Data

  • Method: X-RAY DIFFRACTION
  • Resolution: 1.81 Å
  • R-Value Free:  0.259 (Depositor), 0.297 (DCC) 
  • R-Value Work:  0.216 (Depositor) 
  • R-Value Observed: 0.219 (Depositor) 
Space Group: P 21 21 21
Unit Cell:
Length ( Å )Angle ( ˚ )
a = 27.99α = 90
b = 65.11β = 90
c = 106.17γ = 90
Software Package:
Software NamePurpose
PHENIXrefinement
XDSdata reduction
APEX 2data scaling
PHASERphasing

Structure Validation

View Full Validation Report



Entry History 

& Funding Information

Deposition Data


Funding OrganizationLocationGrant Number
National Natural Science Foundation of China (NSFC)China--

Revision History  (Full details and data files)

  • Version 1.0: 2024-10-02
    Type: Initial release
  • Version 1.1: 2024-12-18
    Changes: Database references, Structure summary
  • Version 1.2: 2025-04-23
    Changes: Database references