8SY7

E. coli DNA-directed RNA polymerase transcription elongation complex bound the unnatural dB-STP base pair in the active site


Experimental Data Snapshot

  • Method: ELECTRON MICROSCOPY
  • Resolution: 2.65 Å
  • Aggregation State: PARTICLE 
  • Reconstruction Method: SINGLE PARTICLE 

wwPDB Validation   3D Report Full Report


This is version 1.0 of the entry. See complete history


Literature

A unified Watson-Crick geometry drives transcription of six-letter expanded DNA alphabets by E. coli RNA polymerase.

Oh, J.Shan, Z.Hoshika, S.Xu, J.Chong, J.Benner, S.A.Lyumkis, D.Wang, D.

(2023) Nat Commun 14: 8219-8219

  • DOI: https://doi.org/10.1038/s41467-023-43735-9
  • Primary Citation of Related Structures:  
    8SY5, 8SY6, 8SY7

  • PubMed Abstract: 

    Artificially Expanded Genetic Information Systems (AEGIS) add independently replicable unnatural nucleotide pairs to the natural G:C and A:T/U pairs found in native DNA, joining the unnatural pairs through alternative modes of hydrogen bonding. Whether and how AEGIS pairs are recognized and processed by multi-subunit cellular RNA polymerases (RNAPs) remains unknown. Here, we show that E. coli RNAP selectively recognizes unnatural nucleobases in a six-letter expanded genetic system. High-resolution cryo-EM structures of three RNAP elongation complexes containing template-substrate UBPs reveal the shared principles behind the recognition of AEGIS and natural base pairs. In these structures, RNAPs are captured in an active state, poised to perform the chemistry step. At this point, the unnatural base pair adopts a Watson-Crick geometry, and the trigger loop is folded into an active conformation, indicating that the mechanistic principles underlying recognition and incorporation of natural base pairs also apply to AEGIS unnatural base pairs. These data validate the design philosophy of AEGIS unnatural basepairs. Further, we provide structural evidence supporting a long-standing hypothesis that pair mismatch during transcription occurs via tautomerization. Together, our work highlights the importance of Watson-Crick complementarity underlying the design principles of AEGIS base pair recognition.


  • Organizational Affiliation

    Division of Pharmaceutical Sciences, Skaggs School of Pharmacy & Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA, 92093, USA.


Macromolecules

Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 1
MoleculeChains Sequence LengthOrganismDetailsImage
DNA-directed RNA polymerase subunit alphaA,
B [auth G]
329Escherichia coliMutation(s): 0 
Gene Names: rpoApezphssezb3295JW3257
EC: 2.7.7.6
UniProt
Find proteins for P0A7Z4 (Escherichia coli (strain K12))
Explore P0A7Z4 
Go to UniProtKB:  P0A7Z4
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP0A7Z4
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 2
MoleculeChains Sequence LengthOrganismDetailsImage
DNA-directed RNA polymerase subunit betaC [auth I]1,342Escherichia coliMutation(s): 0 
Gene Names: rpoBgroNnitBrifronstlstvtabDb3987JW3950
EC: 2.7.7.6
UniProt
Find proteins for P0A8V2 (Escherichia coli (strain K12))
Explore P0A8V2 
Go to UniProtKB:  P0A8V2
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP0A8V2
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 3
MoleculeChains Sequence LengthOrganismDetailsImage
DNA-directed RNA polymerase subunit beta'D [auth J]1,430Escherichia coliMutation(s): 0 
Gene Names: rpoCtabBb3988JW3951
EC: 2.7.7.6
UniProt
Find proteins for P0A8T7 (Escherichia coli (strain K12))
Explore P0A8T7 
Go to UniProtKB:  P0A8T7
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP0A8T7
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 4
MoleculeChains Sequence LengthOrganismDetailsImage
DNA-directed RNA polymerase subunit omegaE [auth K]91Escherichia coliMutation(s): 0 
Gene Names: rpoZZ5075ECs4524
EC: 2.7.7.6
UniProt
Find proteins for P0A800 (Escherichia coli (strain K12))
Explore P0A800 
Go to UniProtKB:  P0A800
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP0A800
Sequence Annotations
Expand
  • Reference Sequence

Find similar nucleic acids by:  Sequence   |   3D Structure  

Entity ID: 5
MoleculeChains LengthOrganismImage
Non-template single stranded DNAF [auth N]18synthetic construct
Sequence Annotations
Expand
  • Reference Sequence

Find similar nucleic acids by:  Sequence   |   3D Structure  

Entity ID: 6
MoleculeChains LengthOrganismImage
RNA oligomerG [auth R]9synthetic construct
Sequence Annotations
Expand
  • Reference Sequence
Find similar nucleic acids by:  (by identity cutoff)  |  3D Structure
Entity ID: 7
MoleculeChains LengthOrganismImage
Template single stranded DNAH [auth T]29synthetic construct
Sequence Annotations
Expand
  • Reference Sequence
Small Molecules
Ligands 3 Unique
IDChains Name / Formula / InChI Key2D Diagram3D Interactions
X0O (Subject of Investigation/LOI)
Query on X0O

Download Ideal Coordinates CCD File 
L [auth J][[(2~{R},3~{S},4~{R},5~{S})-5-(4-azanyl-1-methyl-2-oxidanylidene-pyrimidin-5-yl)-3,4-bis(oxidanyl)oxolan-2-yl]methoxy-oxidanyl-phosphoryl] phosphono hydrogen phosphate
C10 H18 N3 O14 P3
JCPRKVWIOJLDIF-XUTVFYLZSA-N
ZN
Query on ZN

Download Ideal Coordinates CCD File 
I [auth J],
J
ZINC ION
Zn
PTFCDOFLOPIGGS-UHFFFAOYSA-N
MG
Query on MG

Download Ideal Coordinates CCD File 
K [auth J]MAGNESIUM ION
Mg
JLVVSXFLKOJNIY-UHFFFAOYSA-N
Experimental Data & Validation

Experimental Data

  • Method: ELECTRON MICROSCOPY
  • Resolution: 2.65 Å
  • Aggregation State: PARTICLE 
  • Reconstruction Method: SINGLE PARTICLE 
EM Software:
TaskSoftware PackageVersion
MODEL REFINEMENTPHENIX1.19.2_4158:
RECONSTRUCTIONcryoSPARC4.2.1

Structure Validation

View Full Validation Report



Entry History & Funding Information

Deposition Data


Funding OrganizationLocationGrant Number
National Institutes of Health/National Institute of General Medical Sciences (NIH/NIGMS)United StatesGM102362
National Institutes of Health/National Institute of General Medical Sciences (NIH/NIGMS)United StatesGM148476
National Institutes of Health/National Institute Of Allergy and Infectious Diseases (NIH/NIAID)United StatesU54 AI170855
National Science Foundation (NSF, United States)United StatesMCB-2048095
National Institutes of Health/National Cancer Institute (NIH/NCI)United States1R21CA251043-01A1
National Science Foundation (NSF, United States)United StatesMCB- 2123995
National Institutes of Health/National Institute of General Medical Sciences (NIH/NIGMS)United StatesU24 GM129541
National Institutes of Health/National Institute of General Medical Sciences (NIH/NIGMS)United StatesS10 OD032467

Revision History  (Full details and data files)

  • Version 1.0: 2023-12-27
    Type: Initial release