7NVY

RNA polymerase II pre-initiation complex with closed promoter DNA in proximal position


Experimental Data Snapshot

  • Method: ELECTRON MICROSCOPY
  • Resolution: 7.30 Å
  • Aggregation State: PARTICLE 
  • Reconstruction Method: SINGLE PARTICLE 

wwPDB Validation   3D Report Full Report


This is version 1.1 of the entry. See complete history


Literature

Structures of mammalian RNA polymerase II pre-initiation complexes.

Aibara, S.Schilbach, S.Cramer, P.

(2021) Nature 594: 124-128

  • DOI: https://doi.org/10.1038/s41586-021-03554-8
  • Primary Citation of Related Structures:  
    7NVR, 7NVS, 7NVT, 7NVU, 7NVV, 7NVW, 7NVX, 7NVY, 7NVZ, 7NW0

  • PubMed Abstract: 

    The initiation of transcription is a focal point for the regulation of gene activity during mammalian cell differentiation and development. To initiate transcription, RNA polymerase II (Pol II) assembles with general transcription factors into a pre-initiation complex (PIC) that opens promoter DNA. Previous work provided the molecular architecture of the yeast 1-9 and human 10,11 PIC and a topological model for DNA opening by the general transcription factor TFIIH 12-14 . Here we report the high-resolution cryo-electron microscopy structure of PIC comprising human general factors and Sus scrofa domesticus Pol II, which is 99.9% identical to human Pol II. We determine the structures of PIC with closed and opened promoter DNA at 2.5-2.8 Å resolution, and resolve the structure of TFIIH at 2.9-4.0 Å resolution. We capture the TFIIH translocase XPB in the pre- and post-translocation states, and show that XPB induces and propagates a DNA twist to initiate the opening of DNA approximately 30 base pairs downstream of the TATA box. We also provide evidence that DNA opening occurs in two steps and leads to the detachment of TFIIH from the core PIC, which may stop DNA twisting and enable RNA chain initiation.


  • Organizational Affiliation

    Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany.


Macromolecules

Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 1
MoleculeChains Sequence LengthOrganismDetailsImage
TFIIH basal transcription factor complex helicase XPD subunitA [auth 0]760Homo sapiensMutation(s): 0 
Gene Names: ERCC2XPDXPDC
EC: 3.6.4.12
UniProt & NIH Common Fund Data Resources
Find proteins for P18074 (Homo sapiens)
Explore P18074 
Go to UniProtKB:  P18074
PHAROS:  P18074
GTEx:  ENSG00000104884 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP18074
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 2
MoleculeChains Sequence LengthOrganismDetailsImage
General transcription factor IIH subunit 1B [auth 1]548Homo sapiensMutation(s): 0 
Gene Names: GTF2H1BTF2
UniProt & NIH Common Fund Data Resources
Find proteins for P32780 (Homo sapiens)
Explore P32780 
Go to UniProtKB:  P32780
PHAROS:  P32780
GTEx:  ENSG00000110768 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP32780
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 3
MoleculeChains Sequence LengthOrganismDetailsImage
General transcription factor IIH subunit 4C [auth 2]462Homo sapiensMutation(s): 0 
Gene Names: GTF2H4
UniProt & NIH Common Fund Data Resources
Find proteins for Q92759 (Homo sapiens)
Explore Q92759 
Go to UniProtKB:  Q92759
PHAROS:  Q92759
GTEx:  ENSG00000213780 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupQ92759
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 4
MoleculeChains Sequence LengthOrganismDetailsImage
CDK-activating kinase assembly factor MAT1D [auth 3]309Homo sapiensMutation(s): 0 
Gene Names: MNAT1CAP35MAT1RNF66
UniProt & NIH Common Fund Data Resources
Find proteins for P51948 (Homo sapiens)
Explore P51948 
Go to UniProtKB:  P51948
PHAROS:  P51948
GTEx:  ENSG00000020426 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP51948
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 5
MoleculeChains Sequence LengthOrganismDetailsImage
General transcription factor IIH subunit 3E [auth 4]308Homo sapiensMutation(s): 0 
Gene Names: GTF2H3
UniProt & NIH Common Fund Data Resources
Find proteins for Q13889 (Homo sapiens)
Explore Q13889 
Go to UniProtKB:  Q13889
PHAROS:  Q13889
GTEx:  ENSG00000111358 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupQ13889
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 6
MoleculeChains Sequence LengthOrganismDetailsImage
General transcription factor IIH subunit 5F [auth 5]71Homo sapiensMutation(s): 0 
Gene Names: GTF2H5C6orf175TTDA
UniProt & NIH Common Fund Data Resources
Find proteins for Q6ZYL4 (Homo sapiens)
Explore Q6ZYL4 
Go to UniProtKB:  Q6ZYL4
PHAROS:  Q6ZYL4
GTEx:  ENSG00000272047 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupQ6ZYL4
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 7
MoleculeChains Sequence LengthOrganismDetailsImage
General transcription factor IIH subunit 2G [auth 6]395Homo sapiensMutation(s): 0 
Gene Names: GTF2H2BTF2P44
UniProt & NIH Common Fund Data Resources
Find proteins for Q13888 (Homo sapiens)
Explore Q13888 
Go to UniProtKB:  Q13888
GTEx:  ENSG00000145736 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupQ13888
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 8
MoleculeChains Sequence LengthOrganismDetailsImage
General transcription and DNA repair factor IIH helicase subunit XPBH [auth 7]782Homo sapiensMutation(s): 0 
Gene Names: ERCC3XPBXPBC
EC: 3.6.4.12
UniProt & NIH Common Fund Data Resources
Find proteins for P19447 (Homo sapiens)
Explore P19447 
Go to UniProtKB:  P19447
PHAROS:  P19447
GTEx:  ENSG00000163161 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP19447
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 9
MoleculeChains Sequence LengthOrganismDetailsImage
RPB1I [auth A]1,970Sus scrofaMutation(s): 0 
UniProt
Find proteins for A0A7M4DUC2 (Sus scrofa)
Explore A0A7M4DUC2 
Go to UniProtKB:  A0A7M4DUC2
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupA0A7M4DUC2
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 10
MoleculeChains Sequence LengthOrganismDetailsImage
DNA-directed RNA polymerase subunit betaJ [auth B]1,174Sus scrofaMutation(s): 0 
EC: 2.7.7.6
UniProt
Find proteins for I3LGP4 (Sus scrofa)
Explore I3LGP4 
Go to UniProtKB:  I3LGP4
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupI3LGP4
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 11
MoleculeChains Sequence LengthOrganismDetailsImage
DNA-directed RNA polymerase II subunit RPB3K [auth C]275Sus scrofaMutation(s): 0 
UniProt
Find proteins for I3LCH3 (Sus scrofa)
Explore I3LCH3 
Go to UniProtKB:  I3LCH3
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupI3LCH3
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 12
MoleculeChains Sequence LengthOrganismDetailsImage
RPOL4c domain-containing proteinL [auth D]142Sus scrofaMutation(s): 0 
UniProt
Find proteins for A0A287ADR4 (Sus scrofa)
Explore A0A287ADR4 
Go to UniProtKB:  A0A287ADR4
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupA0A287ADR4
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 13
MoleculeChains Sequence LengthOrganismDetailsImage
DNA-directed RNA polymerase II subunit EM [auth E]210Sus scrofaMutation(s): 0 
UniProt
Find proteins for I3LSI7 (Sus scrofa)
Explore I3LSI7 
Go to UniProtKB:  I3LSI7
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupI3LSI7
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 14
MoleculeChains Sequence LengthOrganismDetailsImage
DNA-directed RNA polymerase II subunit FN [auth F]127Sus scrofaMutation(s): 0 
UniProt
Find proteins for A0A4X1VEK9 (Sus scrofa)
Explore A0A4X1VEK9 
Go to UniProtKB:  A0A4X1VEK9
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupA0A4X1VEK9
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 15
MoleculeChains Sequence LengthOrganismDetailsImage
DNA-directed RNA polymerase II subunit RPB7O [auth G]172Sus scrofaMutation(s): 0 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 16
MoleculeChains Sequence LengthOrganismDetailsImage
DNA-directed RNA polymerases I, II, and III subunit RPABC3P [auth H]150Sus scrofaMutation(s): 0 
UniProt
Find proteins for I3LCB2 (Sus scrofa)
Explore I3LCB2 
Go to UniProtKB:  I3LCB2
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupI3LCB2
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 17
MoleculeChains Sequence LengthOrganismDetailsImage
DNA-directed RNA polymerase II subunit RPB9Q [auth I]125Sus scrofaMutation(s): 0 
UniProt
Find proteins for P60899 (Sus scrofa)
Explore P60899 
Go to UniProtKB:  P60899
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP60899
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 18
MoleculeChains Sequence LengthOrganismDetailsImage
DNA-directed RNA polymerases I, II, and III subunit RPABC5R [auth J]67Sus scrofaMutation(s): 0 
UniProt
Find proteins for A0A4X1VYD0 (Sus scrofa)
Explore A0A4X1VYD0 
Go to UniProtKB:  A0A4X1VYD0
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupA0A4X1VYD0
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 19
MoleculeChains Sequence LengthOrganismDetailsImage
RNA_pol_L_2 domain-containing proteinS [auth K]117Sus scrofaMutation(s): 0 
UniProt
Find proteins for F1RKE4 (Sus scrofa)
Explore F1RKE4 
Go to UniProtKB:  F1RKE4
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupF1RKE4
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 20
MoleculeChains Sequence LengthOrganismDetailsImage
RNA polymerase II subunit KT [auth L]58Sus scrofaMutation(s): 0 
UniProt
Find proteins for A0A4X1TRS6 (Sus scrofa)
Explore A0A4X1TRS6 
Go to UniProtKB:  A0A4X1TRS6
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupA0A4X1TRS6
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 21
MoleculeChains Sequence LengthOrganismDetailsImage
Transcription initiation factor IIBU [auth M]316Homo sapiensMutation(s): 0 
Gene Names: GTF2BTF2BTFIIB
EC: 2.3.1.48
UniProt & NIH Common Fund Data Resources
Find proteins for Q00403 (Homo sapiens)
Explore Q00403 
Go to UniProtKB:  Q00403
PHAROS:  Q00403
GTEx:  ENSG00000137947 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupQ00403
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 23
MoleculeChains Sequence LengthOrganismDetailsImage
TATA-box-binding proteinW [auth O]339Homo sapiensMutation(s): 0 
Gene Names: TBPGTF2D1TF2DTFIID
UniProt & NIH Common Fund Data Resources
Find proteins for P20226 (Homo sapiens)
Explore P20226 
Go to UniProtKB:  P20226
PHAROS:  P20226
GTEx:  ENSG00000112592 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP20226
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 24
MoleculeChains Sequence LengthOrganismDetailsImage
General transcription factor IIF subunit 1X [auth Q]517Homo sapiensMutation(s): 0 
Gene Names: GTF2F1RAP74
UniProt & NIH Common Fund Data Resources
Find proteins for P35269 (Homo sapiens)
Explore P35269 
Go to UniProtKB:  P35269
GTEx:  ENSG00000125651 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP35269
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 25
MoleculeChains Sequence LengthOrganismDetailsImage
General transcription factor IIF subunit 2Y [auth R]249Homo sapiensMutation(s): 0 
Gene Names: GTF2F2RAP30
EC: 3.6.4.12
UniProt & NIH Common Fund Data Resources
Find proteins for P13984 (Homo sapiens)
Explore P13984 
Go to UniProtKB:  P13984
PHAROS:  P13984
GTEx:  ENSG00000188342 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP13984
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 27
MoleculeChains Sequence LengthOrganismDetailsImage
Transcription initiation factor IIA subunit 1AA [auth U]376Homo sapiensMutation(s): 0 
Gene Names: GTF2A1TF2A1
UniProt & NIH Common Fund Data Resources
Find proteins for P52655 (Homo sapiens)
Explore P52655 
Go to UniProtKB:  P52655
PHAROS:  P52655
GTEx:  ENSG00000165417 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP52655
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 28
MoleculeChains Sequence LengthOrganismDetailsImage
Transcription initiation factor IIA subunit 2BA [auth V]109Homo sapiensMutation(s): 0 
Gene Names: GTF2A2TF2A2
UniProt & NIH Common Fund Data Resources
Find proteins for P52657 (Homo sapiens)
Explore P52657 
Go to UniProtKB:  P52657
PHAROS:  P52657
GTEx:  ENSG00000140307 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP52657
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 29
MoleculeChains Sequence LengthOrganismDetailsImage
General transcription factor IIE subunit 1CA [auth W]439Homo sapiensMutation(s): 0 
Gene Names: GTF2E1TF2E1
UniProt & NIH Common Fund Data Resources
Find proteins for P29083 (Homo sapiens)
Explore P29083 
Go to UniProtKB:  P29083
PHAROS:  P29083
GTEx:  ENSG00000153767 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP29083
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 30
MoleculeChains Sequence LengthOrganismDetailsImage
Transcription initiation factor IIE subunit betaDA [auth X]291Homo sapiensMutation(s): 0 
Gene Names: GTF2E2TF2E2
UniProt & NIH Common Fund Data Resources
Find proteins for P29084 (Homo sapiens)
Explore P29084 
Go to UniProtKB:  P29084
PHAROS:  P29084
GTEx:  ENSG00000197265 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP29084
Sequence Annotations
Expand
  • Reference Sequence

Find similar proteins by:  Sequence   |   3D Structure  

Entity ID: 31
MoleculeChains Sequence LengthOrganismDetailsImage
Unassigned peptide, likely TFIIE-betaEA [auth Y]19Homo sapiensMutation(s): 0 
Sequence Annotations
Expand
  • Reference Sequence

Find similar proteins by:  Sequence   |   3D Structure  

Entity ID: 32
MoleculeChains Sequence LengthOrganismDetailsImage
Unassigned peptide, likely XPBFA [auth Z]8Homo sapiensMutation(s): 0 
Sequence Annotations
Expand
  • Reference Sequence
Find similar nucleic acids by:  (by identity cutoff)  |  3D Structure
Entity ID: 22
MoleculeChains LengthOrganismImage
Non-template DNAV [auth N]106Human mastadenovirus C
Sequence Annotations
Expand
  • Reference Sequence
Find similar nucleic acids by:  (by identity cutoff)  |  3D Structure
Entity ID: 26
MoleculeChains LengthOrganismImage
Template DNAZ [auth T]106Human mastadenovirus C
Sequence Annotations
Expand
  • Reference Sequence
Small Molecules
Ligands 3 Unique
IDChains Name / Formula / InChI Key2D Diagram3D Interactions
SF4
Query on SF4

Download Ideal Coordinates CCD File 
GA [auth 0]IRON/SULFUR CLUSTER
Fe4 S4
LJBDFODJNLIPKO-UHFFFAOYSA-N
ZN
Query on ZN

Download Ideal Coordinates CCD File 
HA [auth 3]
IA [auth 3]
JA [auth 4]
KA [auth 4]
LA [auth 6]
HA [auth 3],
IA [auth 3],
JA [auth 4],
KA [auth 4],
LA [auth 6],
MA [auth 6],
NA [auth 6],
OA [auth A],
PA [auth A],
RA [auth B],
SA [auth C],
TA [auth I],
UA [auth I],
VA [auth J],
WA [auth L],
XA [auth M],
YA [auth W]
ZINC ION
Zn
PTFCDOFLOPIGGS-UHFFFAOYSA-N
MG
Query on MG

Download Ideal Coordinates CCD File 
QA [auth A]MAGNESIUM ION
Mg
JLVVSXFLKOJNIY-UHFFFAOYSA-N
Experimental Data & Validation

Experimental Data

  • Method: ELECTRON MICROSCOPY
  • Resolution: 7.30 Å
  • Aggregation State: PARTICLE 
  • Reconstruction Method: SINGLE PARTICLE 

Structure Validation

View Full Validation Report



Entry History & Funding Information

Deposition Data


Funding OrganizationLocationGrant Number
H2020 Marie Curie Actions of the European CommissionGermany894862
German Research Foundation (DFG)GermanyEXC 2067/1 39072994
German Research Foundation (DFG)GermanySFB860
German Research Foundation (DFG)GermanySPP2191
European Research Council (ERC)Germany882357

Revision History  (Full details and data files)

  • Version 1.0: 2021-05-05
    Type: Initial release
  • Version 1.1: 2021-06-16
    Changes: Database references