4Y52

Crystal structure of 5-Carboxycytosine Recognition by RNA Polymerase II during Transcription Elongation.


Experimental Data Snapshot

  • Method: X-RAY DIFFRACTION
  • Resolution: 3.50 Å
  • R-Value Free: 0.232 
  • R-Value Work: 0.201 
  • R-Value Observed: 0.203 

wwPDB Validation 3D Report Full Report


This is version 1.5 of the entry. See complete history


Literature

Molecular basis for 5-carboxycytosine recognition by RNA polymerase II elongation complex.

Wang, L.Zhou, Y.Xu, L.Xiao, R.Lu, X.Chen, L.Chong, J.Li, H.He, C.Fu, X.D.Wang, D.

(2015) Nature 523: 621-625

  • DOI: 10.1038/nature14482
  • Structures With Same Primary Citation

  • PubMed Abstract: 
  • DNA methylation at selective cytosine residues (5-methylcytosine (5mC)) and their removal by TET-mediated DNA demethylation are critical for setting up pluripotent states in early embryonic development. TET enzymes successively convert 5mC to 5-hydro ...

    DNA methylation at selective cytosine residues (5-methylcytosine (5mC)) and their removal by TET-mediated DNA demethylation are critical for setting up pluripotent states in early embryonic development. TET enzymes successively convert 5mC to 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC), and 5-carboxylcytosine (5caC), with 5fC and 5caC subject to removal by thymine DNA glycosylase (TDG) in conjunction with base excision repair. Early reports indicate that 5fC and 5caC could be stably detected on enhancers, promoters and gene bodies, with distinct effects on gene expression, but the mechanisms have remained elusive. Here we determined the X-ray crystal structure of yeast elongating RNA polymerase II (Pol II) in complex with a DNA template containing oxidized 5mCs, revealing specific hydrogen bonds between the 5-carboxyl group of 5caC and the conserved epi-DNA recognition loop in the polymerase. This causes a positional shift for incoming nucleoside 5'-triphosphate (NTP), thus compromising nucleotide addition. To test the implication of this structural insight in vivo, we determined the global effect of increased 5fC/5caC levels on transcription, finding that such DNA modifications indeed retarded Pol II elongation on gene bodies. These results demonstrate the functional impact of oxidized 5mCs on gene expression and suggest a novel role for Pol II as a specific and direct epigenetic sensor during transcription elongation.


    Organizational Affiliation

    Skaggs School of Pharmacy and Pharmaceutical Sciences, The University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA.



Macromolecules

Find similar proteins by: Sequence  |  Structure

Entity ID: 4
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerases I, II, and III subunit RPABC1E215Saccharomyces cerevisiae S288CMutation(s): 0 
Gene Names: RPB5RPA7RPC9YBR154CYBR1204
Find proteins for P20434 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Explore P20434 
Go to UniProtKB:  P20434
Protein Feature View
 ( Mouse scroll to zoom / Hold left click to move )
  • Reference Sequence

Find similar proteins by: Sequence  |  Structure

Entity ID: 5
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerases I, II, and III subunit RPABC2F155Saccharomyces cerevisiae S288CMutation(s): 0 
Gene Names: RPO26RPB6YPR187WP9677.8
Find proteins for P20435 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Explore P20435 
Go to UniProtKB:  P20435
Protein Feature View
 ( Mouse scroll to zoom / Hold left click to move )
  • Reference Sequence

Find similar proteins by: Sequence  |  Structure

Entity ID: 9
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerase II subunit RPB11K120Saccharomyces cerevisiae S288CMutation(s): 0 
Gene Names: RPB11YOL005C
Find proteins for P38902 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Explore P38902 
Go to UniProtKB:  P38902
Protein Feature View
 ( Mouse scroll to zoom / Hold left click to move )
  • Reference Sequence

Find similar proteins by: Sequence  |  Structure

Entity ID: 10
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerases I, II, and III subunit RPABC4L70Saccharomyces cerevisiae S288CMutation(s): 0 
Gene Names: RPC10RPB12YHR143W-AYHR143BW
Find proteins for P40422 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Explore P40422 
Go to UniProtKB:  P40422
Protein Feature View
 ( Mouse scroll to zoom / Hold left click to move )
  • Reference Sequence

Find similar proteins by: Sequence  |  Structure

Entity ID: 1
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerase II subunit RPB1A1733Saccharomyces cerevisiae S288CMutation(s): 0 
Gene Names: RPO21RPB1RPB220SUA8YDL140CD2150
EC: 2.7.7.6
Find proteins for P04050 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Explore P04050 
Go to UniProtKB:  P04050
Protein Feature View
 ( Mouse scroll to zoom / Hold left click to move )
  • Reference Sequence

Find similar proteins by: Sequence  |  Structure

Entity ID: 2
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerase II subunit RPB2B1224Saccharomyces cerevisiae S288CMutation(s): 0 
Gene Names: RPB2RPB150RPO22YOR151C
EC: 2.7.7.6
Find proteins for P08518 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Explore P08518 
Go to UniProtKB:  P08518
Protein Feature View
 ( Mouse scroll to zoom / Hold left click to move )
  • Reference Sequence

Find similar proteins by: Sequence  |  Structure

Entity ID: 3
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerase II subunit RPB3C318Saccharomyces cerevisiae S288CMutation(s): 0 
Gene Names: RPB3YIL021W
Find proteins for P16370 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Explore P16370 
Go to UniProtKB:  P16370
Protein Feature View
 ( Mouse scroll to zoom / Hold left click to move )
  • Reference Sequence

Find similar proteins by: Sequence  |  Structure

Entity ID: 6
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerases I, II, and III subunit RPABC3H146Saccharomyces cerevisiae S288CMutation(s): 0 
Gene Names: RPB8YOR224CYOR50-14
Find proteins for P20436 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Explore P20436 
Go to UniProtKB:  P20436
Protein Feature View
 ( Mouse scroll to zoom / Hold left click to move )
  • Reference Sequence

Find similar proteins by: Sequence  |  Structure

Entity ID: 7
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerase II subunit RPB9I122Saccharomyces cerevisiae S288CMutation(s): 0 
Gene Names: RPB9YGL070C
Find proteins for P27999 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Explore P27999 
Go to UniProtKB:  P27999
Protein Feature View
 ( Mouse scroll to zoom / Hold left click to move )
  • Reference Sequence

Find similar proteins by: Sequence  |  Structure

Entity ID: 8
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerases I, II, and III subunit RPABC5J70Saccharomyces cerevisiae S288CMutation(s): 0 
Gene Names: RPB10YOR210W
Find proteins for P22139 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Explore P22139 
Go to UniProtKB:  P22139
Protein Feature View
 ( Mouse scroll to zoom / Hold left click to move )
  • Reference Sequence

Find similar nucleic acids by: Sequence  |  Structure

Entity ID: 11
MoleculeChainsLengthOrganism
DNA (29-MER)T29synthetic construct
  • Find similar nucleic acids by: Sequence   |   Structure
Entity ID: 12
MoleculeChainsLengthOrganism
DNA (5'-D(*CP*TP*GP*CP*TP*TP*AP*TP*CP*GP*GP*TP*AP*G)-3')N14synthetic construct
  • Find similar nucleic acids by: Sequence   |   Structure
Entity ID: 13
MoleculeChainsLengthOrganism
RNA (5'-D(*AP*UP*GP*GP*AP*GP*AP*GP*G)-3')R9synthetic construct
Small Molecules
Ligands 2 Unique
IDChainsName / Formula / InChI Key2D Diagram3D Interactions
ZN
Query on ZN

Download CCD File 
A, B, C, I, J, L
ZINC ION
Zn
PTFCDOFLOPIGGS-UHFFFAOYSA-N
 Ligand Interaction
MG
Query on MG

Download CCD File 
A
MAGNESIUM ION
Mg
JLVVSXFLKOJNIY-UHFFFAOYSA-N
 Ligand Interaction
Experimental Data & Validation

Experimental Data

  • Method: X-RAY DIFFRACTION
  • Resolution: 3.50 Å
  • R-Value Free: 0.232 
  • R-Value Work: 0.201 
  • R-Value Observed: 0.203 
  • Space Group: C 1 2 1
Unit Cell:
Length ( Å )Angle ( ˚ )
a = 166.703α = 90
b = 221.644β = 100.35
c = 192.41γ = 90
Software Package:
Software NamePurpose
PHENIXrefinement
DENZOdata reduction
SCALEPACKdata scaling
PHASERphasing
PDB_EXTRACTdata extraction
Cootmodel building
REFMACrefinement

Structure Validation

View Full Validation Report



Entry History & Funding Information

Deposition Data


Funding OrganizationLocationGrant Number
National Institutes of Health/National Institute of General Medical Sciences (NIH/NIGMS)United StatesGM102362
Sidney Kimmel Foundation for Cancer ResearchUnited StatesKimmel Scholars award
Skaggs School of Pharmacy and Pharmaceutical Sciences, UCSDUnited Statesstart-up funds

Revision History 

  • Version 1.0: 2015-07-15
    Type: Initial release
  • Version 1.1: 2015-08-12
    Changes: Database references
  • Version 1.2: 2016-06-15
    Changes: Structure summary
  • Version 1.3: 2017-09-20
    Changes: Author supporting evidence, Derived calculations, Refinement description
  • Version 1.4: 2017-10-04
    Changes: Author supporting evidence
  • Version 1.5: 2019-12-25
    Changes: Author supporting evidence