Time-stamped yearly snapshots of the PDB archive (starting with 2004)
are available via FTP at
It is hoped that these snapshots will provide readily identifiable data
sets for research on the PDB archive. These snapshots include the
coordinate data in multiple formats and the experimental data.
The script at
may be used to make a local copy of a snapshot or sections of the
snapshot. The time required to download the entire archive can take
quite a while (18+ hours); however, the time required to download only
the coordinate data in a single format should be much less. While the
time will depend on the speed of your network, our tests show that all
of the coordinate files in PDB format from a snapshot can be downloaded
approximately 3 hours.
The archival snapshots follow the historical directory
structure--coordinate files are contained in subdirectories named after
the two middle characters of the PDB ID (for example, 100d is found in
the directory '00'). The date and time stamp of each file indicates the
last time the file was modified.
The main PDB archive is updated weekly at
Information on obtaining and maintaining copies of the entire PDB
archive or certain portions of it is available from this README file in
the FTP archive.
users will find download information for downloading from the current
The RCSB PDB (citation) is managed by two members of the Research Collaboratory for Structural Bioinformatics:
RCSB PDB is a member of the
The RCSB PDB is funded by a grant (DBI-1338415) from the
National Science Foundation, the
National Institutes of Health, and the
US Department of Energy.