6WN7

Homo sapiens S100A5

PDB DOI: https://doi.org/10.2210/pdb6WN7/pdb

Classification: METAL BINDING PROTEIN
Organism(s): Homo sapiens
Expression System: Escherichia coli
Mutation(s): Yes

Deposited: 2020-04-22 Released: 2020-09-30
Deposition Author(s): Perkins, A., Harms, M.J., Wong, C.E., Wheeler, L.C.

Experimental Data Snapshot

Method: X-RAY DIFFRACTION
Resolution: 1.25 Å
R-Value Free: 0.206
R-Value Work: 0.171
R-Value Observed: 0.186

wwPDB Validation 3D Report Full Report

This is version 1.3 of the entry. See complete history.

Literature

Learning peptide recognition rules for a low-specificity protein.

Wheeler, L.C., Perkins, A., Wong, C.E., Harms, M.J.

(2020) Protein Sci 29: 2259-2273

PubMed: 32979254 Search on PubMedSearch on PubMed Central
DOI: https://doi.org/10.1002/pro.3958
Primary Citation of Related Structures:
6WN7

PubMed Abstract:
Many proteins interact with short linear regions of target proteins. For some proteins, however, it is difficult to identify a well-defined sequence motif that defines its target peptides. To overcome this difficulty, we used supervised machine learning to train a model that treats each peptide as a collection of easily-calculated biochemical features rather than as an amino acid sequence. As a test case, we dissected the peptide-recognition rules for human S100A5 (hA5), a low-specificity calcium binding protein. We trained a Random Forest model against a recently released, high-throughput phage display dataset collected for hA5. The model identifies hydrophobicity and shape complementarity, rather than polar contacts, as the primary determinants of peptide binding specificity in hA5. We tested this hypothesis by solving a crystal structure of hA5 and through computational docking studies of diverse peptides onto hA5. These structural studies revealed that peptides exhibit multiple binding modes at the hA5 peptide interface-all of which have few polar contacts with hA5. Finally, we used our trained model to predict new, plausible binding targets in the human proteome. This revealed a fragment of the protein α-1-syntrophin that binds to hA5. Our work helps better understand the biochemistry and biology of hA5, as well as demonstrating how high-throughput experiments coupled with machine learning of biochemical features can reveal the determinants of binding specificity in low-specificity proteins.

Organizational Affiliation

Institute of Molecular Biology, University of Oregon, Eugene, Oregon, USA.

Macromolecule Content

Total Structure Weight: 66.72 kDa
Atom Count: 4,681
Modelled Residue Count: 536
Deposited Residue Count: 570
Unique protein chains: 1

Macromolecules

Find similar proteins by:

(by identity cutoff) | 3D Structure

Entity ID: 1
Molecule	Chains	Sequence Length	Organism	Details	Image
Protein S100-A5	A B C D E [auth F] A, B, C, D, E [auth F], F [auth E]	95	Homo sapiens	Mutation(s): 2 Gene Names: S100A5, S100D
UniProt & NIH Common Fund Data Resources
Find proteins for P33763 (Homo sapiens) Explore P33763 Go to UniProtKB: P33763
PHAROS: P33763 GTEx: ENSG00000196420
Entity Groups
Sequence Clusters	30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt Group	P33763
Sequence Annotations Expand
Reference Sequence

Small Molecules

Ligands 1 Unique
ID	Chains	Name / Formula / InChI Key	2D Diagram	3D Interactions
CA Query on CA Download Ideal Coordinates CCD File SDF format, chain G [auth A] SDF format, chain H [auth A] SDF format, chain I [auth A] SDF format, chain J [auth A] SDF format, chain K [auth B] SDF format, chain L [auth B] SDF format, chain M [auth B] SDF format, chain N [auth C] SDF format, chain O [auth C] SDF format, chain P [auth C] SDF format, chain Q [auth D] SDF format, chain R [auth D] SDF format, chain S [auth D] SDF format, chain T [auth F] SDF format, chain U [auth F] SDF format, chain V [auth F] SDF format, chain W [auth E] SDF format, chain X [auth E] MOL2 format, chain G [auth A] MOL2 format, chain H [auth A] MOL2 format, chain I [auth A] MOL2 format, chain J [auth A] MOL2 format, chain K [auth B] MOL2 format, chain L [auth B] MOL2 format, chain M [auth B] MOL2 format, chain N [auth C] MOL2 format, chain O [auth C] MOL2 format, chain P [auth C] MOL2 format, chain Q [auth D] MOL2 format, chain R [auth D] MOL2 format, chain S [auth D] MOL2 format, chain T [auth F] MOL2 format, chain U [auth F] MOL2 format, chain V [auth F] MOL2 format, chain W [auth E] MOL2 format, chain X [auth E]	G [auth A] H [auth A] I [auth A] J [auth A] K [auth B] G [auth A], H [auth A], I [auth A], J [auth A], K [auth B], L [auth B], M [auth B], N [auth C], O [auth C], P [auth C], Q [auth D], R [auth D], S [auth D], T [auth F], U [auth F], V [auth F], W [auth E], X [auth E]	CALCIUM ION Ca BHPQYMZQTOCNFJ-UHFFFAOYSA-N		Interactions Focus chain G [auth A] Focus chain H [auth A] Focus chain I [auth A] Focus chain J [auth A] Focus chain K [auth B] Focus chain L [auth B] Focus chain M [auth B] Focus chain N [auth C] Focus chain O [auth C] Focus chain P [auth C] Focus chain Q [auth D] Focus chain R [auth D] Focus chain S [auth D] Focus chain T [auth F] Focus chain U [auth F] Focus chain V [auth F] Focus chain W [auth E] Focus chain X [auth E] Interactions & Density Focus chain G [auth A] Focus chain H [auth A] Focus chain I [auth A] Focus chain J [auth A] Focus chain K [auth B] Focus chain L [auth B] Focus chain M [auth B] Focus chain N [auth C] Focus chain O [auth C] Focus chain P [auth C] Focus chain Q [auth D] Focus chain R [auth D] Focus chain S [auth D] Focus chain T [auth F] Focus chain U [auth F] Focus chain V [auth F] Focus chain W [auth E] Focus chain X [auth E]

Experimental Data & Validation

Experimental Data

Method: X-RAY DIFFRACTION
Resolution: 1.25 Å
R-Value Free: 0.206
R-Value Work: 0.171
R-Value Observed: 0.186
Space Group: P 3₂

Unit Cell:

Length ( Å )	Angle ( ˚ )
a = 76.28	α = 90
b = 76.28	β = 90
c = 84.24	γ = 120

Software Package:

Software Name	Purpose
PHENIX	refinement
MOSFLM	data reduction
PHASER	phasing
SCALA	data scaling

Structure Validation

View Full Validation Report

Entry History

Deposition Data

Released Date: 2020-09-30

Deposition Author(s):

Revision History (Full details and data files)

Version 1.0: 2020-09-30
Type: Initial release
Version 1.1: 2020-10-07
Changes: Database references
Version 1.2: 2020-11-11
Changes: Database references
Version 1.3: 2023-10-18
Changes: Data collection, Database references, Refinement description

Prepare Data

Validate Data

Deposit Data

Help and Resources

6WN7

Homo sapiens S100A5

Learning peptide recognition rules for a low-specificity protein.

Entity ID: 1

UniProt & NIH Common Fund Data Resources

Entity Groups

Sequence Annotations

Expand

Experimental Data

Structure Validation

Deposition Data

Revision History (Full details and data files)