6UER

Crystal form 2: Structure of TBP bound to C-C mismatch at pH 7


Experimental Data Snapshot

  • Method: X-RAY DIFFRACTION
  • Resolution: 2.50 Å
  • R-Value Free: 0.238 
  • R-Value Work: 0.199 
  • R-Value Observed: 0.202 

wwPDB Validation   3D Report Full Report


This is version 1.4 of the entry. See complete history


Literature

DNA mismatches reveal conformational penalties in protein-DNA recognition.

Afek, A.Shi, H.Rangadurai, A.Sahay, H.Senitzki, A.Xhani, S.Fang, M.Salinas, R.Mielko, Z.Pufall, M.A.Poon, G.M.K.Haran, T.E.Schumacher, M.A.Al-Hashimi, H.M.Gordan, R.

(2020) Nature 587: 291-296

  • DOI: https://doi.org/10.1038/s41586-020-2843-2
  • Primary Citation of Related Structures:  
    6UEO, 6UEP, 6UEQ, 6UER

  • PubMed Abstract: 

    Transcription factors recognize specific genomic sequences to regulate complex gene-expression programs. Although it is well-established that transcription factors bind to specific DNA sequences using a combination of base readout and shape recognition, some fundamental aspects of protein-DNA binding remain poorly understood 1,2 . Many DNA-binding proteins induce changes in the structure of the DNA outside the intrinsic B-DNA envelope. However, how the energetic cost that is associated with distorting the DNA contributes to recognition has proven difficult to study, because the distorted DNA exists in low abundance in the unbound ensemble 3-9 . Here we use a high-throughput assay that we term SaMBA (saturation mismatch-binding assay) to investigate the role of DNA conformational penalties in transcription factor-DNA recognition. In SaMBA, mismatched base pairs are introduced to pre-induce structural distortions in the DNA that are much larger than those induced by changes in the Watson-Crick sequence. Notably, approximately 10% of mismatches increased transcription factor binding, and for each of the 22 transcription factors that were examined, at least one mismatch was found that increased the binding affinity. Mismatches also converted non-specific sites into high-affinity sites, and high-affinity sites into 'super sites' that exhibit stronger affinity than any known canonical binding site. Determination of high-resolution X-ray structures, combined with nuclear magnetic resonance measurements and structural analyses, showed that many of the DNA mismatches that increase binding induce distortions that are similar to those induced by protein binding-thus prepaying some of the energetic cost incurred from deforming the DNA. Our work indicates that conformational penalties are a major determinant of protein-DNA recognition, and reveals mechanisms by which mismatches can recruit transcription factors and thus modulate replication and repair activities in the cell 10,11 .


  • Organizational Affiliation

    Center for Genomic and Computational Biology, Duke University School of Medicine, Durham, NC, USA.


Macromolecules

Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 1
MoleculeChains Sequence LengthOrganismDetailsImage
TATA-box-binding protein 1A,
D [auth B]
219Arabidopsis thalianaMutation(s): 0 
Gene Names: TBP1At3g13445MRP15.10
UniProt
Find proteins for P28147 (Arabidopsis thaliana)
Explore P28147 
Go to UniProtKB:  P28147
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP28147
Sequence Annotations
Expand
  • Reference Sequence

Find similar nucleic acids by:  Sequence   |   3D Structure  

Entity ID: 2
MoleculeChains LengthOrganismImage
DNA (5'-D(*GP*CP*TP*AP*TP*AP*AP*AP*CP*GP*GP*GP*CP*A)-3')B [auth C],
E
14synthetic construct
Sequence Annotations
Expand
  • Reference Sequence

Find similar nucleic acids by:  Sequence   |   3D Structure  

Entity ID: 3
MoleculeChains LengthOrganismImage
DNA (5'-D(*TP*GP*CP*CP*CP*CP*TP*TP*TP*AP*TP*AP*GP*C)-3')C [auth D],
F
14synthetic construct
Sequence Annotations
Expand
  • Reference Sequence
Experimental Data & Validation

Experimental Data

  • Method: X-RAY DIFFRACTION
  • Resolution: 2.50 Å
  • R-Value Free: 0.238 
  • R-Value Work: 0.199 
  • R-Value Observed: 0.202 
  • Space Group: P 21 21 21
Unit Cell:
Length ( Å )Angle ( ˚ )
a = 88.879α = 90
b = 91.228β = 90
c = 97.58γ = 90
Software Package:
Software NamePurpose
MOSFLMdata reduction
SCALAdata scaling
MOLREPphasing
PHENIXrefinement
PDB_EXTRACTdata extraction

Structure Validation

View Full Validation Report



Entry History & Funding Information

Deposition Data


Funding OrganizationLocationGrant Number
National Institutes of Health/National Institute of General Medical Sciences (NIH/NIGMS)United StatesR35GM130290

Revision History  (Full details and data files)

  • Version 1.0: 2020-09-02
    Type: Initial release
  • Version 1.1: 2020-10-21
    Changes: Database references
  • Version 1.2: 2020-11-04
    Changes: Database references
  • Version 1.3: 2020-11-25
    Changes: Database references
  • Version 1.4: 2023-10-11
    Changes: Data collection, Database references, Refinement description