The European Bioinformatics Institute's data resources: towards systems biology

  title={The European Bioinformatics Institute's data resources: towards systems biology},
  author={Cath Brooksbank and Graham Cameron and Janet M. Thornton},
  journal={Nucleic Acids Research},
  pages={D46 - D53}
Genomic and post-genomic biological research has provided fine-grain insights into the molecular processes of life, but also threatens to drown biomedical researchers in data. Moreover, as new high-throughput technologies are developed, the types of data that are gathered en masse are diversifying. The need to collect, store and curate all this information in ways that allow its efficient retrieval and exploitation is greater than ever. The European Bioinformatics Institute's (EBI's) databases… 
Biospider: A Web Server for Automating Metabolome Annotations
The developed BioSpider is essentially an automated report generator designed specifically to tabulate and summarize data on biomolecules - both large and small, and is believed to be a particularly valuable tool for researchers in metabolomics.
Curation of viral genomes: challenges, applications and the way forward
VirGen, a comprehensive viral genome resource that serves as an annotation and analysis pipeline has been developed for the curation of public domain viral genome data and is predicted conformational and sequential epitopes of known antigenic proteins using in-house developed algorithms, a step towards reverse vaccinology.
The Online Metabolic and Molecular Bases of Inherited Disease; Chapter 3.1: Metabolism and Metabolic Disease Resources on the Web, Page 1
This chapter reviews some of the key online database that are dedicated to explaining or displaying up-to-date information on metabolism, metabolic pathways, and metabolic diseases, including metabolic pathway databases, metabolomic databases, genetic and metabolic disease databases, (4) single-nucleotide polymorphism (SNP) and mutation databases, and (5) sequence databases.
Work flows in life science
This thesis is devoted to find out which problems bioinformaticians experience using workflow systems and to provide solutions for these problems.
Reconstruction annotation jamborees: a community approach to systems biology
The development of a consensus network reconstruction that is accepted and used by the research community necessitates a collective effort to formalize such networks that are specific to a target organism.
Computing genomic science : bioinformatics and standardisation in proteomics
The research provides valuable insight into the social construction of post-genomic knowledge and adds to the growing literature in the field of science and technology studies (STS) by revealing how socially constructed knowledges are translated and transferred within and between newly created scientific communities.
ChemBank: a small-molecule screening and cheminformatics resource database
The goal of ChemBank is to provide life scientists unfettered access to biomedically relevant data and tools heretofore available primarily in the private sector.
A bioinformatician's view of the metabolome.
  • I. Nobeli, J. Thornton
  • Biology, Chemistry
    BioEssays : news and reviews in molecular, cellular and developmental biology
  • 2006
Modelling of the interactions of metabolites with other entities in the cell, and eventually complete modelling of reaction pathways will be essential for analysis of the experimental data, and prediction of an organism's response to environmental challenges.
STITCH: interaction networks of chemicals and proteins
STITCH (‘search tool for interactions of chemicals’) integrates information about interactions from metabolic pathways, crystal structures, binding experiments and drug–target relationships, andferred information from phenotypic effects, text mining and chemical structure similarity is used to predict relations between chemicals.


The KEGG resource for deciphering the genome
A knowledge-based approach for network prediction is developed, which is to predict, given a complete set of genes in the genome, the protein interaction networks that are responsible for various cellular processes.
MIPS: analysis and annotation of proteins from whole genomes
The Munich Information Center for Protein Sequences (MIPS at the GSF), Neuherberg, Germany, provides resources related to genome information and develops databases covering computable information such as the basic evolutionary relations among all genes.
NCBI GEO: mining millions of expression profiles—database and tools
Recent database developments that facilitate effective mining and visualization of gene expression data are described, providing features to examine data from both experiment- and gene-centric perspectives using user-friendly Web-based interfaces accessible to those without computational or microarray-related analytical expertise.
Reactome - A Knowledgebase of Biological Pathways
The Reactome data model allows us to represent many diverse processes in the human system, including the pathways of intermediary metabolism, regulatory pathways, and signal transduction, and high-level processes, such as the cell cycle.
The HUPO PSI's Molecular Interaction format—a community standard for the representation of protein interaction data
This work proposes a community standard data model for the representation and exchange of protein interaction data, jointly developed by members of the Proteomics Standards Initiative (PSI) and the Human Proteome Organization (HUPO).
E-MSD: an integrated data resource for bioinformatics
The Macromolecular Structure Database (MSD) group has worked closely with the UniProt group at the EBI to clean up the taxonomy and sequence cross-reference information in the MSD and UniProt databases, and exchange of annotation information has enriched the structural information inThe MSD database with annotation from wider sequence-oriented resources.
EnsMart: a generic system for fast and flexible access to biological data.
The EnsMart system, a generic data warehousing solution for fast and flexible querying of large biological data sets and integration with third-party data and tools, has been applied to Ensembl, where it extends its genomic browser capabilities, facilitating rapid retrieval of customized data sets.
The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003
The SWISS-PROT protein knowledgebase connects amino acid sequences with the current knowledge in the Life Sciences by providing an interdisciplinary overview of relevant information by bringing together experimental results, computed features and sometimes even contradictory conclusions.
Integr8 and Genome Reviews: integrated views of complete genomes and proteomes
This analysis focuses on bacterial and archaeal DNA sequences in which annotation has been upgraded through the integration of data from many sources, including the EMBL Nucleotide Sequence Database, the UniProt Knowledgebase, InterPro, CluSTr, GOA and HOGENOM.
BIND--The Biomolecular Interaction Network Database.
The BIND anticipates the coming large influx of interaction information from high-throughput proteomics efforts including detailed information about post-translational modifications from mass spectrometry.