Database resources of the National Center for Biotechnology Information

  title={Database resources of the National Center for Biotechnology Information},
  author={David L. Wheeler and Colombe Chappey and Alex E. Lash and Detlef D. Leipe and Thomas L. Madden and Gregory D. Schuler and Tatiana A. Tatusova and Barbara A. Rapp},
  journal={Nucleic Acids Research},
  pages={D39 - D45}
In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data retrieval systems and computational resources for the analysis of data in GenBank and other biological data made available through NCBI's website. NCBI resources include Entrez, Entrez Programming Utilities, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, RefSeq, UniGene… 

Tables from this paper

MuGeN: Simultaneous Exploration of Multiple Genomes and Computer Analysis Results
This work proposes a software package, MuGeN, built for navigating through multiple annotated genomes, capable of retrieving annotated sequences in several formats, stored in local files, or available in databases over the network.
Semantic web infrastructure for fungal enzyme biotechnologists
Graphia: A platform for the graph-based visualisation and analysis of complex data
Graphia is an open-source platform created for the graph-based analysis of complex data, e.g. transcriptomics, proteomics, genomics data, designed to rapidly visualise very large graphs in 2D or 3D space, providing a wide range of functionality for graph exploration.
The GAAS Metagenomic Tool and Its Estimations of Viral and Microbial Average Genome Size in Four Major Biomes
Genome relative Abundance and Average Size (GAAS), a complete software package that provides improved estimates of community composition and average genome length for metagenomes in both textual and graphical formats, is presented.
Epitope-based universal vaccine for Human T-lymphotropic virus-1 (HTLV-1)
A potential multi epitope vaccine that can raise the desired immune response against HTLV-1 and be useful in developing effective vaccines against Human T-lymphotropic virus is revealed.
Comprehensive in silico analysis and molecular dynamics of the superoxide dismutase 1 (SOD1) variants related to amyotrophic lateral sclerosis
Understanding the effects of SOD1 mutations on protein structure and function facilitates the design of further experiments and provides relevant information on the molecular mechanism of pathology, which may contribute to improvements in existing treatments for ALS.
Correlation between gene expression levels under drought stress and synonymous codon usage in rice plant by in-silico study
It is found that in genes highly expressing under drought 1) GC content was higher, 2) ENC value was lower, 3) the preferred codons of some amino acids changed and 4) the RSCU ratio of GC- end codons relative to AT-end codons for 18 amino acids increased significantly compared with those in other genes.
Metagenomic Characterization of Chesapeake Bay Virioplankton
A metagenomic library was constructed from Chesapeake Bay virioplankton to suggest that dsDNA viruses are likely one of the largest reservoirs of unknown genetic diversity in the biosphere.
Transcriptional program of the endocrine pancreas in mice and humans.
The Endocrine Pancreas Consortium was formed in late 1999 to derive and sequence cDNA libraries enriched for rare transcripts expressed in the mammalian endocrine pancreas and has constructed the corresponding microarray, termed "PancChip 4.0," which contains >9,000 nonredundant elements.
In silico analysis of the V66M variant of human BDNF in psychiatric disorders: An approach to precision medicine
The V66M mutation of BDNF affects the BDNF essential motions, hydrogen-bonding and secondary structure particularly at its pre and pro-domain, which are crucial for its activity and distribution.


dbSNP: the NCBI database of genetic variation
The dbSNP database is a general catalog of genome variation to address the large-scale sampling designs required by association studies, gene mapping and evolutionary biology, and is integrated with other sources of information at NCBI such as GenBank, PubMed, LocusLink and the Human Genome Project data.
Complete genomes in WWW Entrez: data representation and analysis
Flexible web based views, precomputed relationships, and immediate access to analytical tools provide scientists with a portal into the new insights to be gained from completed genome sequences.
BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences.
'BLAST 2 Sequences', a new BLAST-based tool for aligning two protein or nucleotide sequences, is described, utilizing the BLAST algorithm for pairwise DNA-DNA or protein-protein sequence comparison.
Entrez: molecular biology database and retrieval system.
Data management and analysis for gene expression arrays
An information system to manage and analyse large-scale expression data, ArrayDB, is developed to provide integrated gene expression reports by linking array target sequences with NCBI's Entrez retrieval system, UniGene and KEGG pathway views.
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
SAGEmap: a public gene expression resource.
A public gene expression data repository and online data access and analysis, WWW and FTP sites for serial analysis of gene expression (SAGE) data, and the organization and use of this resource are described.
Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements.
The use of composition-based statistics is particularly beneficial for large-scale automated applications of PSI-BLAST, and the use, for each database sequence, of a position-specific scoring system tuned to that sequence's amino acid composition.
PatternHunter: faster and more sensitive homology search
A new homology search algorithm 'PatternHunter' is presented that uses a novel seed model for increased sensitivity and new hit-processing techniques for significantly increased speed.
Basic local alignment search tool.