The Protein Information Resource (PIR) and the PIR-International Protein Sequence Database

@article{George1997ThePI,
  title={The Protein Information Resource (PIR) and the PIR-International Protein Sequence Database},
  author={David G. George and Robert J. Dodson and John S. Garavelli and Daniel H. Haft and Lois T. Hunt and Christopher R. Marzec and Bruce C. Orcutt and Kathryn E. Sidman and Geetha Y. Srinivasarao and Lai-Su L. Yeh and Leslie Arminski and Robert S. Ledley and Akira Tsugita and Winona C. Barker},
  journal={Nucleic acids research},
  year={1997},
  volume={25 1},
  pages={
          24-8
        }
}
From its origin, the PIR has aspired to support research in computational biology and genomics through the compilation of a comprehensive, quality controlled and well-organized protein sequence information resource. The resource originated with the pioneering work of the late Margaret O. Dayhoff in the early 1960s. Since 1988, the Protein Sequence Database has been maintained collaboratively by PIR-International, an association of macromolecular sequence data collection centers dedicated to… 

Tables from this paper

MIPS: a database for protein sequences, homology data and yeast genome information
The MIPS group (Martinsried Institute for Protein Sequences) at the Max-Planck-Institute for Biochemistry, Martinsried near Munich, Germany, collects, processes and distributes protein sequence data
The GenBank sequence database.
TLDR
GenBank, the National Institutes of Health's genetic sequence database, is an annotated collection of all publicly available nucleotide and protein sequences, which are grouped into divisions; some of these divisions are phylogenetically based, whereas others are based on the tech-nical approach that was used to generate the sequence information.
Proclass protein family database: new version with motif alignments.
  • C. Wu, S. Shivakumar
  • Biology, Computer Science
    Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing
  • 1998
TLDR
The new version of ProClass, a protein family database which organizes non-redundant sequence entries into families defined collectively by the ProSite patterns and PIR superfamilies, is released.
Automated genome sequence analysis and annotation
TLDR
An automatic system for preliminary functional annotation of protein sequences that has been applied to the analysis of sets of sequences from complete genomes, both to refine overall performance and to make new discoveries comparable to those made by human experts is presented.
Integrating and Exploiting Large-Scale, Heterogeneous and Autonomous Databases with an Ontology for Molecular Biology
TLDR
Here, heuristics for building an ontology and the upper level and database branch of a prospective Ontology for Molecular Biology are presented and compared to other ontologies with respect to suitability for integrating a set of heterogeneous and autonomous databases in molecular biology.
An Insight of Biological Databases Used in Bioinformatics
TLDR
This chapter reviews current knowledge of the different types of databases available with examples of their file formats and describes the rationale behind their creation.
BioPD: a web-based information center for bioactive peptides
Bioinformatics: from genome data to biological knowledge.
Current trends in Bioinformatics: An Insight
TLDR
This chapter reviews current knowledge of the different types of databases available with examples of their file formats and describes the rationale behind their creation.
Proteomics and bioinformatics.
...
...

References

SHOWING 1-10 OF 27 REFERENCES
The PIR-International databases
TLDR
The architecture of the Protein Sequence Database, a number of other PIR-International databases, and mechanisms for providing access to and for distribution of these databases are described.
The PIR-International Protein Sequence Database
TLDR
The Protein Information Resource (PIR) supports research on molecular evolution, functional genomics, and computational biology by maintaining a comprehensive, non-redundant, well-organized and freely available protein sequence database.
The European Bioinformatics Institute (EBI) databases
TLDR
The databases and services of the European Bioinformatics Institute (EBI) are described, which maintain and distributes the EMBL Nucleotide Sequence Database, Europe's primary nucleotide sequence data resource.
LISTA, LISTA-HOP and LISTA-HON: a comprehensive compilation of protein encoding sequences and its associated homology databases from the yeast Saccharomyces
TLDR
This release includes reports from full Smith and Watermann peptide-level searches against a non-redundant protein sequence database and new entries from the systematic sequencing efforts.
Biomolecular data : a resource in transition
TLDR
Part 1 The emergence of biomolecular data management: data proliferation - a challenge for science and for CODATA, A.A.Burks and D.Brutlag integrated access to sequence and structural data - principles of design of comprehensive databases for molecular biology.
The Genome Sequence DataBase (GSDB): meeting the challenge of genomic sequencing
The genome sequence database (GSDB) is a complete, publicly available relational database of DNA sequences and annotation maintained by the National Center for Genome Resources (NCGR) under a
Improved tools for biological sequence comparison.
  • W. Pearson, D. Lipman
  • Biology, Computer Science
    Proceedings of the National Academy of Sciences of the United States of America
  • 1988
TLDR
Three computer programs for comparisons of protein and DNA sequences can be used to search sequence data bases, evaluate similarity scores, and identify periodic structures based on local sequence similarity.
...
...