CRISPR-associated endoribonuclease Cas2 - P45956 (CAS2_ECOLI)


Protein Feature View of PDB entries mapped to a UniProtKB sequence  

  • Number of PDB entries for P45956: 14
CRISPR (clustered regularly interspaced short palindromic repeat), is an adaptive immune system that provides protection against mobile genetic elements (viruses, transposable elements and conjugative plasmids) (PubMed:21255106, PubMed:24920831, PubMed:24793649). CRISPR clusters contain sequences complementary to antecedent mobile elements and target invading nucleic acids. CRISPR clusters are transcribed and processed into CRISPR RNA (crRNA). The Cas1-Cas2 complex is involved in CRISPR adaptation, the first stage of CRISPR immunity, being required for the addition/removal of CRISPR spacers at the leader end of the CRISPR locus (PubMed:24920831, PubMed:25707795, PubMed:24793649). The Cas1-Cas2 complex introduces staggered nicks into both strands of the CRISPR array near the leader repeat and joins the 5'-ends of the repeat strands with the 3'-ends of the new spacer sequence (PubMed:24920831). Spacer DNA integration requires supercoiled target DNA and 3'-OH ends on the inserted (spacer) DNA and probably initiates with a nucleophilic attack of the C 3'-OH end of the protospacer on the minus strand of the first repeat sequence (PubMed:25707795). Expression of Cas1-Cas2 in a strain lacking both genes permits spacer acquisition (PubMed:24793649, PubMed:24920831). Cas2 not seen to bind DNA alone; the Cas1-Cas2 complex preferentially binds CRISPR-locus DNA (PubMed:24793649). Highest binding is seen to a dual forked DNA complex with 3'-overhangs and a protospacer-adjacent motif-complement specifically positioned (PubMed:26478180). The protospacer DNA lies across a flat surface extending from 1 Cas1 dimer, across the Cas2 dimer and contacting the other Cas1 dimer; the 23 bp-long ds section of the DNA is bracketed by 1 Tyr-22 from each of the Cas1 dimers (PubMed:26478180, PubMed:26503043). Cas1 cuts within the 3'-overhang, to generate a 33-nucleotide DNA that is probably incorporated into the CRISPR leader by a cut-and-paste mechanism (PubMed:26478180). This subunit's probable nuclease activity is not required for spacer acquisition (PubMed:24793649). UniProt
Pathway Maps
      ESCHER  BiGG
Subunit Structure
Homodimer (Ref.10). Part of the Cas1-Cas2 complex (PubMed:24920831, PubMed:24793649, PubMed:25707795, Ref.12, PubMed:26478180, PubMed:26503043). Forms a hexamer with 2 Cas1 dimers sandwiching a Cas2 dimer (PubMed:24793649). The DNA lies across a flat surface extending from 1 Cas1 dimer, across the Cas2 dimer and contacting the other Cas1 dimer. Only 1 Cas1 protein from each dimer is catalytic, the other interacts with the Cas2 dimer and possibly target DNA (PubMed:26478180, PubMed:26503043). UniProt
Substrate DNA-binding induces large structural changes that generate a surface for DNA-binding across the Cas2 dimer and formation of an optimal catalytic site (PubMed:26478180). UniProt
  • Other Gene names: ygbF, cas2, b2754, JW5438
This protein in other organisms (by gene name):
The Protein Feature View requires a browser that supports SVG (Scalable Vector Graphics). Mouse over tracks and labels for more information.
Data origin/color codes
The vertical color bar on the left side indicates data provenance.
Data in green originates from UniProtKB  
Variation data (sourced from UniProt) shows non-genetic variation from the ExPASy   and dbSNP   websites.
Data in yellow originates from Pfam  , by interacting with the HMMER3 web site  
Data in purple originates from Phosphosite  .
Data in orange originates from the SCOP   (version 1.75) and SCOPe   (version 2.04) classifications.
Data in grey has been calculated using BioJava  . Protein disorder predictions are based on JRONN (Troshin, P. and Barton, G. J. unpublished), a Java implementation of RONN  
  • Red: potentially disorderd region
  • Blue: probably ordered region.
Hydropathy has been calculated using a sliding window of 15 residues and summing up scores from standard hydrophobicity tables.
  • Red: hydrophobic
  • Blue: hydrophilic.
Data in lilac represent the genomic exon structure projected onto the UniProt sequence.
Data in blue originates from PDB
  • Secstruc: Secondary structure projected from representative PDB entries onto the UniProt sequence.
Sequence Mismatches It is now possible to see information about expression tags, cloning artifacts, and many other details related to sequence mismatches.
Icons represent a number of different sequence modifications that can be observed in PDB files. For example the 'T' icon T represents expression tags that have been added to the sequence. The 'E' icon E represents an engineered mutation. However, besides these two, there are many other icons. For more information about the meaning and exact position of a sequence modification, move the cursor over the icon.
Validation Track

For more details on the Validation Track (Structure Summary Page only) see the dedicated help page.

Data in red indicates combined ranges of Homology Models from SBKB   and the Protein Model Portal  
The PDB to UniProt mapping is based on the data provided by the EBI SIFTS project. See also Velankar et al., Nucleic Acids Research 33, D262-265 (2005).
Organism icons generated by under CC BY. The authors are: Freepik, Icons8, OCHA, Scott de Jonge.

For more details on the Protein Feature view see the dedicated help page.