FASTA SEQUENCE FILES ON RCSB PDB FTP ARCHIVES
he RCSB PDB maintains several FASTA formatted sequence files on the
FTP archives. The sequences for all currently released experimental
structures are contained in pdb_seqres.txt, available in uncompressed
form at ftp://ftp.rcsb.org/pub/pdb/derived_data/pdb_seqres.txt and in
Unix compressed (".Z") format at
ftp://ftp.rcsb.org/pub/pdb/derived_data/pdb_seqres.txt.Z. These two
files contain all sequences for structures queried on the Home Page,
QuickSearch, SearchLite, and SearchFields.
PDB depositors are given the opportunity to prerelease the sequences
of their structures before releasing the coordinate data.
Prereleased sequences for unreleased structures are contained in the
separate file pre-released.seq, available in uncompressed form at
structures can be queried on the Status Query page.
BATCH FILE DOWNLOAD SCRIPT NOW AVAILABLE
script to download large numbers of files from the PDB FTP site is
now available at
ftp://ftp.rcsb.org/pub/pdb/software/getPdbStructures.pl. This simple
Perl script can be run locally to download files from a user's list
of PDB IDs. Options are available to download coordinate files in
either PDB or mmCIF format, as well as experimental data files. The
script creates a directory structure for the downloaded
files. Further details regarding usage of this script can be found at
he RCSB PDB is available from several Web and FTP sites located around the
world. Users are also invited to preview the newly reengineered RCSB
web site at www.rcsb.org/pdb.
The access statistics are given below for the primary RCSB PDB website