7S03

DNA-binding domain of human SETMAR in complex with Hsmar1 terminal inverted repeat (TIR) DNA


Experimental Data Snapshot

  • Method: X-RAY DIFFRACTION
  • Resolution: 2.37 Å
  • R-Value Free: 0.235 
  • R-Value Work: 0.211 
  • R-Value Observed: 0.212 

wwPDB Validation   3D Report Full Report


This is version 1.0 of the entry. See complete history


Literature

Structural and genome-wide analyses suggest that transposon-derived protein SETMAR alters transcription and splicing.

Chen, Q.Bates, A.M.Hanquier, J.N.Simpson, E.Rusch, D.B.Podicheti, R.Liu, Y.Wek, R.C.Cornett, E.M.Georgiadis, M.M.

(2022) J Biol Chem 298: 101894-101894

  • DOI: https://doi.org/10.1016/j.jbc.2022.101894
  • Primary Citation of Related Structures:  
    7S03

  • PubMed Abstract: 

    Extensive portions of the human genome have unknown function, including those derived from transposable elements. One such element, the DNA transposon Hsmar1, entered the primate lineage approximately 50 million years ago leaving behind terminal inverted repeat (TIR) sequences and a single intact copy of the Hsmar1 transposase, which retains its ancestral TIR-DNA-binding activity, and is fused with a lysine methyltransferase SET domain to constitute the chimeric SETMAR gene. Here, we provide a structural basis for recognition of TIRs by SETMAR and investigate the function of SETMAR through genome-wide approaches. As elucidated in our 2.37 Å crystal structure, SETMAR forms a dimeric complex with each DNA-binding domain bound specifically to TIR-DNA through the formation of 32 hydrogen bonds. We found that SETMAR recognizes primarily TIR sequences (∼5000 sites) within the human genome as assessed by chromatin immunoprecipitation sequencing analysis. In two SETMAR KO cell lines, we identified 163 shared differentially expressed genes and 233 shared alternative splicing events. Among these genes are several pre-mRNA-splicing factors, transcription factors, and genes associated with neuronal function, and one alternatively spliced primate-specific gene, TMEM14B, which has been identified as a marker for neocortex expansion associated with brain evolution. Taken together, our results suggest a model in which SETMAR impacts differential expression and alternative splicing of genes associated with transcription and neuronal function, potentially through both its TIR-specific DNA-binding and lysine methyltransferase activities, consistent with a role for SETMAR in simian primate development.


  • Organizational Affiliation

    Department of Biochemistry and Molecular Biology, Indiana University School of Medicine, Indianapolis, Indiana, USA.


Macromolecules

Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 1
MoleculeChains Sequence LengthOrganismDetailsImage
Histone-lysine N-methyltransferase SETMAR113Homo sapiensMutation(s): 3 
Gene Names: SETMAR
EC: 2.1.1.357 (PDB Primary Data), 3.1 (PDB Primary Data)
UniProt & NIH Common Fund Data Resources
Find proteins for Q53H47 (Homo sapiens)
Explore Q53H47 
Go to UniProtKB:  Q53H47
PHAROS:  Q53H47
GTEx:  ENSG00000170364 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupQ53H47
Sequence Annotations
Expand
  • Reference Sequence
Find similar nucleic acids by:  (by identity cutoff)  |  3D Structure
Entity ID: 2
MoleculeChains LengthOrganismImage
Hsmar1 terminal inverted repeats26Homo sapiens
Sequence Annotations
Expand
  • Reference Sequence
Find similar nucleic acids by:  (by identity cutoff)  |  3D Structure
Entity ID: 3
MoleculeChains LengthOrganismImage
Hsmar1 terminal inverted repeats26Homo sapiens
Sequence Annotations
Expand
  • Reference Sequence
Small Molecules
Modified Residues  1 Unique
IDChains TypeFormula2D DiagramParent
MSE
Query on MSE
A
L-PEPTIDE LINKINGC5 H11 N O2 SeMET
Experimental Data & Validation

Experimental Data

  • Method: X-RAY DIFFRACTION
  • Resolution: 2.37 Å
  • R-Value Free: 0.235 
  • R-Value Work: 0.211 
  • R-Value Observed: 0.212 
  • Space Group: C 2 2 21
Unit Cell:
Length ( Å )Angle ( ˚ )
a = 70.978α = 90
b = 166.172β = 90
c = 66.052γ = 90
Software Package:
Software NamePurpose
BUSTERrefinement
XDSdata reduction
Aimlessdata scaling
PHENIXphasing

Structure Validation

View Full Validation Report



Entry History & Funding Information

Deposition Data


Funding OrganizationLocationGrant Number
National Institutes of Health/National Cancer Institute (NIH/NCI)United StatesR01 CA151367

Revision History  (Full details and data files)

  • Version 1.0: 2022-08-03
    Type: Initial release