4Y7N

The Structure Insight into 5-Carboxycytosine Recognition by RNA Polymerase II during Transcription Elongation.


Experimental Data Snapshot

  • Method: X-RAY DIFFRACTION
  • Resolution: 3.3 Å
  • R-Value Free: 0.250 
  • R-Value Work: 0.207 

wwPDB Validation 3D Report Full Report


This is version 1.3 of the entry. See complete history

Literature

Molecular basis for 5-carboxycytosine recognition by RNA polymerase II elongation complex.

Wang, L.Zhou, Y.Xu, L.Xiao, R.Lu, X.Chen, L.Chong, J.Li, H.He, C.Fu, X.D.Wang, D.

(2015) Nature 523: 621-625

  • DOI: 10.1038/nature14482
  • Primary Citation of Related Structures:  

  • PubMed Abstract: 
  • DNA methylation at selective cytosine residues (5-methylcytosine (5mC)) and their removal by TET-mediated DNA demethylation are critical for setting up pluripotent states in early embryonic development. TET enzymes successively convert 5mC to 5-hydro ...

    DNA methylation at selective cytosine residues (5-methylcytosine (5mC)) and their removal by TET-mediated DNA demethylation are critical for setting up pluripotent states in early embryonic development. TET enzymes successively convert 5mC to 5-hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC), and 5-carboxylcytosine (5caC), with 5fC and 5caC subject to removal by thymine DNA glycosylase (TDG) in conjunction with base excision repair. Early reports indicate that 5fC and 5caC could be stably detected on enhancers, promoters and gene bodies, with distinct effects on gene expression, but the mechanisms have remained elusive. Here we determined the X-ray crystal structure of yeast elongating RNA polymerase II (Pol II) in complex with a DNA template containing oxidized 5mCs, revealing specific hydrogen bonds between the 5-carboxyl group of 5caC and the conserved epi-DNA recognition loop in the polymerase. This causes a positional shift for incoming nucleoside 5'-triphosphate (NTP), thus compromising nucleotide addition. To test the implication of this structural insight in vivo, we determined the global effect of increased 5fC/5caC levels on transcription, finding that such DNA modifications indeed retarded Pol II elongation on gene bodies. These results demonstrate the functional impact of oxidized 5mCs on gene expression and suggest a novel role for Pol II as a specific and direct epigenetic sensor during transcription elongation.


    Organizational Affiliation

    Skaggs School of Pharmacy and Pharmaceutical Sciences, The University of California, San Diego, 9500 Gilman Drive, La Jolla, California 92093, USA.




Macromolecules

Find similar proteins by: Sequence  |  Structure


Entity ID: 1
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerase II subunit RPB1
A
1733Saccharomyces cerevisiae (strain ATCC 204508 / S288c)Mutation(s): 0 
Gene Names: RPO21 (RPB1, RPB220, SUA8)
EC: 2.7.7.6
Find proteins for P04050 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Go to UniProtKB:  P04050
Entity ID: 2
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerase II subunit RPB2,DNA-directed RNA polymerase II subunit RPB2
B
1224Saccharomyces cerevisiae (strain ATCC 204508 / S288c)Mutation(s): 0 
Gene Names: RPB2 (RPB150, RPO22)
EC: 2.7.7.6
Find proteins for P08518 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Go to UniProtKB:  P08518
Entity ID: 3
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerase II subunit RPB3
C
318Saccharomyces cerevisiae (strain ATCC 204508 / S288c)Mutation(s): 0 
Gene Names: RPB3
Find proteins for P16370 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Go to UniProtKB:  P16370
Entity ID: 4
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerases I, II, and III subunit RPABC1
E
215Saccharomyces cerevisiae (strain ATCC 204508 / S288c)Mutation(s): 0 
Gene Names: RPB5 (RPA7, RPC9)
Find proteins for P20434 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Go to UniProtKB:  P20434
Entity ID: 5
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerases I, II, and III subunit RPABC2
F
155Saccharomyces cerevisiae (strain ATCC 204508 / S288c)Mutation(s): 0 
Gene Names: RPO26 (RPB6)
Find proteins for P20435 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Go to UniProtKB:  P20435
Entity ID: 6
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerases I, II, and III subunit RPABC3
H
146Saccharomyces cerevisiae (strain ATCC 204508 / S288c)Mutation(s): 0 
Gene Names: RPB8
Find proteins for P20436 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Go to UniProtKB:  P20436
Entity ID: 7
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerase II subunit RPB9
I
122Saccharomyces cerevisiae (strain ATCC 204508 / S288c)Mutation(s): 0 
Gene Names: RPB9
Find proteins for P27999 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Go to UniProtKB:  P27999
Entity ID: 8
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerases I, II, and III subunit RPABC5
J
70Saccharomyces cerevisiae (strain ATCC 204508 / S288c)Mutation(s): 0 
Gene Names: RPB10
Find proteins for P22139 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Go to UniProtKB:  P22139
Entity ID: 9
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerase II subunit RPB11
K
120Saccharomyces cerevisiae (strain ATCC 204508 / S288c)Mutation(s): 0 
Gene Names: RPB11
Find proteins for P38902 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Go to UniProtKB:  P38902
Entity ID: 10
MoleculeChainsSequence LengthOrganismDetails
DNA-directed RNA polymerases I, II, and III subunit RPABC4
L
70Saccharomyces cerevisiae (strain ATCC 204508 / S288c)Mutation(s): 0 
Gene Names: RPC10 (RPB12)
Find proteins for P40422 (Saccharomyces cerevisiae (strain ATCC 204508 / S288c))
Go to UniProtKB:  P40422
Entity ID: 11
MoleculeChainsLengthOrganism
DNA (29-MER)T29synthetic construct
Entity ID: 12
MoleculeChainsLengthOrganism
DNA (5'-D(*CP*TP*GP*CP*TP*TP*AP*TP*CP*GP*GP*TP*AP*G)-3')N14synthetic construct
Entity ID: 13
MoleculeChainsLengthOrganism
RNA (5'-D(*AP*UP*GP*GP*AP*GP*AP*GP*G)-3')R9synthetic construct
Small Molecules
Ligands 3 Unique
IDChainsName / Formula / InChI Key2D Diagram3D Interactions
ZN
Query on ZN

Download SDF File 
Download CCD File 
A, B, C, I, J, L
ZINC ION
Zn
PTFCDOFLOPIGGS-UHFFFAOYSA-N
 Ligand Interaction
MG
Query on MG

Download SDF File 
Download CCD File 
A
MAGNESIUM ION
Mg
JLVVSXFLKOJNIY-UHFFFAOYSA-N
 Ligand Interaction
G2P
Query on G2P

Download SDF File 
Download CCD File 
T
PHOSPHOMETHYLPHOSPHONIC ACID GUANYLATE ESTER
C11 H18 N5 O13 P3
GXTIEXDFEKYVGY-KQYNXXCUSA-N
 Ligand Interaction
Modified Residues  1 Unique
IDChainsTypeFormula2D DiagramParent
1CC
Query on 1CC
T
DNA LINKINGC10 H14 N3 O9 PDC
Experimental Data & Validation

Experimental Data

  • Method: X-RAY DIFFRACTION
  • Resolution: 3.3 Å
  • R-Value Free: 0.250 
  • R-Value Work: 0.207 
  • Space Group: C 1 2 1
Unit Cell:
Length (Å)Angle (°)
a = 168.181α = 90.00
b = 222.614β = 101.59
c = 192.829γ = 90.00
Software Package:
Software NamePurpose
HKL-2000data collection
PHENIXrefinement
Cootmodel building
PDB_EXTRACTdata extraction
PHASERphasing
SCALEPACKdata scaling
HKL-2000data scaling

Structure Validation

View Full Validation Report or Ramachandran Plots



Entry History & Funding Information

Deposition Data


Funding OrganizationLocationGrant Number
National Institutes of Health/National Institute of General Medical SciencesUnited StatesGM102362
Sidney Kimmel Foundation for Cancer ResearchUnited StatesKimmel Scholars award
Skaggs School of Pharmacy and Pharmaceutical Sciences, UCSDUnited Statesstart-up funds

Revision History 

  • Version 1.0: 2015-07-15
    Type: Initial release
  • Version 1.1: 2015-08-12
    Type: Database references
  • Version 1.2: 2017-09-20
    Type: Author supporting evidence, Database references, Derived calculations, Refinement description
  • Version 1.3: 2017-10-04
    Type: Author supporting evidence