The distribution and query systems of the RCSB Protein Data Bank
@article{Bourne2004TheDA, title={The distribution and query systems of the RCSB Protein Data Bank}, author={Philip E. Bourne and Kenneth J. Addess and Wolfgang Bluhm and Li Chen and Nita Deshpande and Zukang Feng and Ward Fleri and Rachel Kramer Green and Jeffrey C. Merino-Ott and Wayne Townsend-Merino and Helge Weissig and John D. Westbrook and Helen M. Berman}, journal={Nucleic acids research}, year={2004}, volume={32 Database issue}, pages={ D223-5 } }
The Protein Data Bank (PDB; http://www.pdb.org) is the primary source of information on the 3D structure of biological macromolecules. The PDB's mandate is to disseminate this information in the most usable form and as widely as possible. The current query and distribution system is described and an alpha version of the future re-engineered system introduced.
Tables from this paper
133 Citations
The RCSB Protein Data Bank: a redesigned query system and relational database based on the mmCIF schema
- BiologyNucleic Acids Res.
- 2005
The Research Collaboratory for Structural Bioinformatics (RCSB) has completely redesigned its resource for the distribution and query of 3D structure data, expanding the functionality of the existing site by providing structure data in greater detail and uniformity, improved query and enhanced analysis tools.
Using the Tools and Resources of the RCSB Protein Data Bank
- Chemistry
- 2005
The options and procedures for searching and downloading structural data from the PDB, which is maintained by the Research Collaboratory for Structural Bioinformatics (RCSB), are described here along with tools for depositing and assessing the quality of structures.
Using the Tools and Resources of the RCSB Protein Data Bank
- ChemistryCurrent protocols in bioinformatics
- 2007
The options and procedures for searching and downloading structural data from the Research Collaboratory for Structural Bioinformatics (RCSB) PDB are described here, along with tools for assessing the quality of structures.
DIAL: a web-based server for the automatic identification of structural domains in proteins
- MathematicsNucleic Acids Res.
- 2005
DIAL is a web server for the automatic identification of structural domains given the 3D coordinates of a protein that can examine crystallographic multiple chains and provide structural domain solutions that can also describe domain swapping events.
The SSEA server for protein secondary structure alignment
- Computer Science, BiologyBioinform.
- 2005
A web server that computes alignments of protein secondary structures that supports both performing pairwise alignments and searching a secondary structure against a library of domain folds and can calculate global and local secondary structure element alignments.
Citing a Data Repository: A Case Study of the Protein Data Bank
- Computer SciencePloS one
- 2015
A novel metric based on information cascade constructed by exploring the citation network to measure influence between competing works is described and applied to analyze different data citation practices to PDB.
RCSB Protein Data Bank: Celebrating 50 years of the PDB with new tools for understanding and visualizing biological macromolecules in 3D
- ChemistryProtein science : a publication of the Protein Society
- 2021
How the archive is growing and evolving as new experimental methods generate ever larger and more complex biomolecular structures is described; the importance of data standards and data remediation in effective management of the archive and facile integration with more than 50 external data resources are described.
STRIDE: a web server for secondary structure assignment from known atomic coordinates of proteins
- ChemistryNucleic Acids Res.
- 2004
STRIDE is a software tool for secondary structure assignment from atomic resolution protein structures that makes combined use of hydrogen bond energy and statistically derived backbone torsional angle information and is optimized to return resulting assignments in maximal agreement with crystallographers' designations.
iPfam: visualization of protein?Cprotein interactions in PDB at domain and amino acid resolutions
- Computer ScienceBioinform.
- 2005
A web resource is implemented that allows the investigation of protein interactions in the Protein Data Bank structures at the level of Pfam domains and amino acid residues.
The HHpred interactive server for protein homology detection and structure prediction
- Computer Science, BiologyNucleic Acids Res.
- 2005
HHpred is a fast server for remote protein homology detection and structure prediction and is the first to implement pairwise comparison of profile hidden Markov models (HMMs). It allows to search a…
References
SHOWING 1-10 OF 16 REFERENCES
The Protein Data Bank and structural genomics
- ChemistryNucleic Acids Res.
- 2003
The Protein Data Bank (PDB; http://www.pdb.org/) continues to be actively involved in various aspects of the informatics of structural genomics projects--developing and maintaining the Target…
The PDB data uniformity project
- ChemistryNucleic Acids Res.
- 2001
The data uniformity project that is underway to address the inconsistency in PDB data is described.
The Protein Data Bank
- ChemistryNucleic Acids Res.
- 2000
The goals of the PDB are described, the systems in place for data deposition and access, how to obtain further information and plans for the future development of the resource are described.
The Protein Data Bank: unifying the archive
- ChemistryNucleic Acids Res.
- 2002
Progress has been made in validating all data in the PDB archive and in releasing a uniform archive for the community, and a collection of mmCIF data files for the P DB archive is produced.
Clustering of highly homologous sequences to reduce the size of large protein databases
- Computer Science, BiologyBioinform.
- 2001
We present a fast and flexible program for clustering large protein databases at different sequence identity levels. It takes less than 2 h for the all-against-all sequence comparison and clustering…
Protein data representation and query using optimized data decomposition
- Computer ScienceComput. Appl. Biosci.
- 1997
The initial phase of the work, the data representation and query of all available macromolecular structure data, including real-time access to complex property patterns based on the amino acid sequence, is reported.
The Protein Data Bank. A computer-based archival file for macromolecular structures.
- ChemistryEuropean journal of biochemistry
- 1977
STING Millennium: a web-based suite of programs for comprehensive and simultaneous analysis of protein structure and sequence
- BiologyNucleic Acids Res.
- 2003
Using SMS it is now possible to analyze sequence to structure relationships, the quality of the structure, nature and volume of atomic contacts of intra and inter chain type, relative conservation of amino acids at the specific sequence position based on multiple sequence alignment and indications of folding essential residue (FER) based on the relationship of the residue conservation to the intra-chain contacts.
The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003
- BiologyNucleic Acids Res.
- 2003
The SWISS-PROT protein knowledgebase connects amino acid sequences with the current knowledge in the Life Sciences by providing an interdisciplinary overview of relevant information by bringing together experimental results, computed features and sometimes even contradictory conclusions.