Cleavage and polyadenylation specificity factor subunit 7 - Q8N684 (CPSF7_HUMAN)


Protein Feature View of PDB entries mapped to a UniProtKB sequence  

Component of the cleavage factor Im (CFIm) complex that functions as an activator of the pre-mRNA 3'-end cleavage and polyadenylation processing required for the maturation of pre-mRNA into functional mRNAs (PubMed:8626397, PubMed:17024186, PubMed:29276085). CFIm contributes to the recruitment of multiprotein complexes on specific sequences on the pre-mRNA 3'-end, so called cleavage and polyadenylation signals (pA signals) (PubMed:8626397, PubMed:17024186). Most pre-mRNAs contain multiple pA signals, resulting in alternative cleavage and polyadenylation (APA) producing mRNAs with variable 3'-end formation (PubMed:23187700, PubMed:29276085). The CFIm complex acts as a key regulator of cleavage and polyadenylation site choice during APA through its binding to 5'-UGUA-3' elements localized in the 3'-untranslated region (UTR) for a huge number of pre-mRNAs (PubMed:20695905, PubMed:29276085). CPSF7 activates directly the mRNA 3'-processing machinery (PubMed:29276085). Binds to pA signals in RNA substrates (PubMed:8626397, PubMed:17024186). UniProt
Pathway Maps
      ESCHER  BiGG
Subunit Structure
Component of the cleavage factor Im (CFIm) complex which is a heterotetramer composed of two subunits of NUDT21/CPSF5 and two subunits of CPSF6 or CPSF7 or a heterodimer of CPSF6 and CPSF7 (PubMed:8626397, PubMed:20695905, PubMed:23187700, Ref.22). The cleavage factor Im (CFIm) complex associates with the CPSF and CSTF complexes to promote the assembly of the core mRNA 3'-processing machinery (PubMed:29276085). Interacts with NUDT21/CPSF5 (PubMed:29276085). Interacts (via Arg/Ser-rich domain) with FIP1L1 (preferentially via unphosphorylated form and Arg/Glu/Asp-rich region); this interaction mediates, at least in part, the interaction between the CFIm and CPSF complexes and may be inhibited by CPSF7 hyper-phosphorylation (PubMed:29276085). UniProt
Contains an Arg/Ser-rich domain composed of arginine-serine dipeptide repeats within the C-terminal region that is necessary and sufficient for activating mRNA 3'-processing (PubMed:29276085). UniProt
The Protein Feature View requires a browser that supports SVG (Scalable Vector Graphics). Mouse over tracks and labels for more information.
Data origin/color codes
The vertical color bar on the left side indicates data provenance.
Data in green originates from UniProtKB  
Variation data (sourced from UniProt) shows non-genetic variation from the ExPASy   and dbSNP   websites.
Data in yellow originates from Pfam  , by interacting with the HMMER3 web site  
Data in purple originates from Phosphosite  .
Data in orange originates from the SCOP   (version 1.75) and SCOPe   (version 2.04) classifications.
Data in grey has been calculated using BioJava  . Protein disorder predictions are based on JRONN (Troshin, P. and Barton, G. J. unpublished), a Java implementation of RONN  
  • Red: potentially disorderd region
  • Blue: probably ordered region.
Hydropathy has been calculated using a sliding window of 15 residues and summing up scores from standard hydrophobicity tables.
  • Red: hydrophobic
  • Blue: hydrophilic.
Data in lilac represent the genomic exon structure projected onto the UniProt sequence.
Data in blue originates from PDB
  • Secstruc: Secondary structure projected from representative PDB entries onto the UniProt sequence.
Sequence Mismatches It is now possible to see information about expression tags, cloning artifacts, and many other details related to sequence mismatches.
Icons represent a number of different sequence modifications that can be observed in PDB files. For example the 'T' icon T represents expression tags that have been added to the sequence. The 'E' icon E represents an engineered mutation. However, besides these two, there are many other icons. For more information about the meaning and exact position of a sequence modification, move the cursor over the icon.
Validation Track

For more details on the Validation Track (Structure Summary Page only) see the dedicated help page.

Data in red indicates combined ranges of Homology Models from the SWISS-MODEL Repository  
The PDB to UniProt mapping is based on the data provided by the EBI SIFTS project. See also Velankar et al., Nucleic Acids Research 33, D262-265 (2005).
Organism icons generated by under CC BY. The authors are: Freepik, Icons8, OCHA, Scott de Jonge.

For more details on the Protein Feature view see the dedicated help page.