6OD5

Human TCF4 C-terminal bHLH domain in Complex with 12-bp Oligonucleotide Containing E-box Sequence with 5-carboxylcytosines


Experimental Data Snapshot

  • Method: X-RAY DIFFRACTION
  • Resolution: 2.05 Å
  • R-Value Free: 0.280 
  • R-Value Work: 0.234 
  • R-Value Observed: 0.236 

wwPDB Validation   3D Report Full Report


This is version 1.3 of the entry. See complete history


Literature

Structural basis for preferential binding of human TCF4 to DNA containing 5-carboxylcytosine.

Yang, J.Horton, J.R.Li, J.Huang, Y.Zhang, X.Blumenthal, R.M.Cheng, X.

(2019) Nucleic Acids Res 47: 8375-8387

  • DOI: https://doi.org/10.1093/nar/gkz381
  • Primary Citation of Related Structures:  
    6OD3, 6OD4, 6OD5

  • PubMed Abstract: 

    The psychiatric risk-associated transcription factor 4 (TCF4) is linked to schizophrenia. Rare TCF4 coding variants are found in individuals with Pitt-Hopkins syndrome-an intellectual disability and autism spectrum disorder. TCF4 contains a C-terminal basic-helix-loop-helix (bHLH) DNA binding domain which recognizes the enhancer-box (E-box) element 5'-CANNTG-3' (where N = any nucleotide). A subset of the TCF4-occupancy sites have the expanded consensus binding specificity 5'-C(A/G)-CANNTG-3', with an added outer Cp(A/G) dinucleotide; for example in the promoter for CNIH3, a gene involved in opioid dependence. In mammalian genomes, particularly brain, the CpG and CpA dinucleotides can be methylated at the 5-position of cytosine (5mC), and then may undergo successive oxidations to the 5-hydroxymethyl (5hmC), 5-formyl (5fC), and 5-carboxyl (5caC) forms. We find that, in the context of 5'-0CG-1CA-2CG-3TG-3'(where the numbers indicate successive dinucleotides), modification of the central E-box 2CG has very little effect on TCF4 binding, E-box 1CA modification has a negative influence on binding, while modification of the flanking 0CG, particularly carboxylation, has a strong positive impact on TCF4 binding to DNA. Crystallization of TCF4 in complex with unmodified or 5caC-modified oligonucleotides revealed that the basic region of bHLH domain adopts multiple conformations, including an extended loop going through the DNA minor groove, or the N-terminal portion of a long helix binding in the DNA major groove. The different protein conformations enable arginine 576 (R576) to interact, respectively, with a thymine in the minor groove, a phosphate group of DNA backbone, or 5caC in the major groove. The Pitt-Hopkins syndrome mutations affect five arginine residues in the basic region, two of them (R569 and R576) involved in 5caC recognition. Our analyses indicate, and suggest a structural basis for, the preferential recognition of 5caC by a transcription factor centrally important in brain development.


  • Organizational Affiliation

    Department of Molecular and Cellular Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA.


Macromolecules

Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 1
MoleculeChains Sequence LengthOrganismDetailsImage
Transcription factor 4A,
B,
E [auth C],
F [auth D]
62Homo sapiensMutation(s): 0 
Gene Names: TCF4BHLHB19ITF2SEF2
UniProt & NIH Common Fund Data Resources
Find proteins for P15884 (Homo sapiens)
Explore P15884 
Go to UniProtKB:  P15884
PHAROS:  P15884
GTEx:  ENSG00000196628 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP15884
Sequence Annotations
Expand
  • Reference Sequence

Find similar nucleic acids by:  Sequence   |   3D Structure  

Entity ID: 2
MoleculeChains LengthOrganismImage
DNA (5'-D(*AP*(1CC)P*GP*CP*AP*CP*GP*TP*GP*(1CC)P*G)-3')C [auth X],
D [auth Y],
G [auth E],
H [auth F]
12Homo sapiens
Sequence Annotations
Expand
  • Reference Sequence
Experimental Data & Validation

Experimental Data

  • Method: X-RAY DIFFRACTION
  • Resolution: 2.05 Å
  • R-Value Free: 0.280 
  • R-Value Work: 0.234 
  • R-Value Observed: 0.236 
  • Space Group: P 1
Unit Cell:
Length ( Å )Angle ( ˚ )
a = 44.677α = 78.63
b = 44.757β = 78.88
c = 54.64γ = 79.27
Software Package:
Software NamePurpose
PHENIXrefinement
HKL-2000data reduction
HKL-2000data scaling
PHENIXphasing

Structure Validation

View Full Validation Report



Entry History & Funding Information

Deposition Data


Funding OrganizationLocationGrant Number
National Institutes of Health/National Institute of General Medical Sciences (NIH/NIGMS)United StatesGM049245-23

Revision History  (Full details and data files)

  • Version 1.0: 2019-05-29
    Type: Initial release
  • Version 1.1: 2019-10-30
    Changes: Data collection, Database references
  • Version 1.2: 2020-01-01
    Changes: Author supporting evidence
  • Version 1.3: 2023-10-11
    Changes: Data collection, Database references, Refinement description