5YEG

Crystal structure of CTCF ZFs4-8-Hs5-1a complex


Experimental Data Snapshot

  • Method: X-RAY DIFFRACTION
  • Resolution: 2.00 Å
  • R-Value Free: 0.204 
  • R-Value Work: 0.186 
  • R-Value Observed: 0.187 

wwPDB Validation   3D Report Full Report


This is version 1.2 of the entry. See complete history


Literature

Molecular mechanism of directional CTCF recognition of a diverse range of genomic sites

Yin, M.Wang, J.Wang, M.Li, X.Zhang, M.Wu, Q.Wang, Y.

(2017) Cell Res 27: 1365-1377

  • DOI: https://doi.org/10.1038/cr.2017.131
  • Primary Citation of Related Structures:  
    5YEF, 5YEG, 5YEH, 5YEL

  • PubMed Abstract: 

    CTCF, a conserved 3D genome architecture protein, determines proper genome-wide chromatin looping interactions through directional binding to specific sequence elements of four modules within numerous CTCF-binding sites (CBSs) by its 11 zinc fingers (ZFs). Here, we report four crystal structures of human CTCF in complex with CBSs of the protocadherin (Pcdh) clusters. We show that directional CTCF binding to cognate CBSs of the Pcdh enhancers and promoters is achieved through inserting its ZF3, ZFs 4-7, and ZFs 9-11 into the major groove along CBSs, resulting in a sequence-specific recognition of module 4, modules 3 and 2, and module 1, respectively; and ZF8 serves as a spacer element for variable distances between modules 1 and 2. In addition, the base contact with the asymmetric "A" in the central position of modules 2-3, is essential for directional recognition of the CBSs with symmetric core sequences but lacking module 1. Furthermore, CTCF tolerates base changes at specific positions within the degenerated CBS sequences, permitting genome-wide CTCF binding to a diverse range of CBSs. Together, these complex structures provide important insights into the molecular mechanisms for the directionality, diversity, flexibility, dynamics, and conservation of multivalent CTCF binding to its cognate sites across the entire human genome.


  • Organizational Affiliation

    Key Laboratory of RNA Biology, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China.


Macromolecules

Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 1
MoleculeChains Sequence LengthOrganismDetailsImage
Transcriptional repressor CTCF142Homo sapiensMutation(s): 0 
Gene Names: CTCF
UniProt & NIH Common Fund Data Resources
Find proteins for P49711 (Homo sapiens)
Explore P49711 
Go to UniProtKB:  P49711
PHAROS:  P49711
GTEx:  ENSG00000102974 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP49711
Sequence Annotations
Expand
  • Reference Sequence
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 2
MoleculeChains Sequence LengthOrganismDetailsImage
Transcriptional repressor CTCF141Homo sapiensMutation(s): 0 
Gene Names: CTCF
UniProt & NIH Common Fund Data Resources
Find proteins for P49711 (Homo sapiens)
Explore P49711 
Go to UniProtKB:  P49711
PHAROS:  P49711
GTEx:  ENSG00000102974 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP49711
Sequence Annotations
Expand
  • Reference Sequence

Find similar nucleic acids by:  Sequence   |   3D Structure  

Entity ID: 3
MoleculeChains LengthOrganismImage
DNA (5'-D(*TP*CP*GP*CP*CP*CP*TP*CP*TP*GP*CP*TP*GP*GP*TP*TP*AP*AP*AP*G)-3')C [auth D],
E
21synthetic construct
Sequence Annotations
Expand
  • Reference Sequence

Find similar nucleic acids by:  Sequence   |   3D Structure  

Entity ID: 4
MoleculeChains LengthOrganismImage
DNA (5'-D(*AP*CP*TP*TP*TP*AP*AP*CP*CP*AP*GP*CP*AP*GP*AP*GP*GP*GP*CP*G)-3')D [auth C],
F
20synthetic construct
Sequence Annotations
Expand
  • Reference Sequence
Experimental Data & Validation

Experimental Data

  • Method: X-RAY DIFFRACTION
  • Resolution: 2.00 Å
  • R-Value Free: 0.204 
  • R-Value Work: 0.186 
  • R-Value Observed: 0.187 
  • Space Group: P 1
Unit Cell:
Length ( Å )Angle ( ˚ )
a = 46.167α = 78.41
b = 56.502β = 79.31
c = 67.486γ = 78.6
Software Package:
Software NamePurpose
PHENIXrefinement
HKL-3000data reduction
HKL-3000data scaling
PHENIXphasing

Structure Validation

View Full Validation Report



Entry History & Funding Information

Deposition Data


Funding OrganizationLocationGrant Number
National Natural Science Foundation of ChinaChina31630015
National Natural Science Foundation of ChinaChina91440201
National Natural Science Foundation of ChinaChina31571335
National Natural Science Foundation of ChinaChina31400640
National Natural Science Foundation of ChinaChina31630039
National Natural Science Foundation of ChinaChina91640118
National Natural Science Foundation of ChinaChina31470820

Revision History  (Full details and data files)

  • Version 1.0: 2017-11-29
    Type: Initial release
  • Version 1.1: 2019-10-16
    Changes: Data collection, Structure summary
  • Version 1.2: 2024-03-27
    Changes: Data collection, Database references