Databases for Microbiologists

  I. Zhulin
  Published 26 May 2015
  • Biology
  Journal of Bacteriology
ABSTRACT Databases play an increasingly important role in biology. They archive, store, maintain, and share information on genes, genomes, expression data, protein sequences and structures, metabolites and reactions, interactions, and pathways. All these data are critically important to microbiologists. Furthermore, microbiology has its own databases that deal with model microorganisms, microbial diversity, physiology, and pathogenesis. Thousands of biological databases are currently available… 
Bioinformatics Tools for Microbial Diversity Analysis
An overview of genomic tools that have aided identifying isolates, species, and subspecies of uncultured microorganisms and inferring their functional roles is brought in the light of the emergence of next-generation sequencing (NGS) technologies.
The Microbe Directory v2.0: An Expanded Database of Ecological and Phenotypical Features of Microbes
This update adds 68,852 taxa, many new annotation features, an interface for the statistical analysis of microbiomes based on TMD features, and presents a portal for the broad community to add or correct entries.
A review of information resources on antimicrobial resistance genes
The article describes the information resources including data on antimicrobial resistance genes, which are more than ever tended to a full range display of information on various genes of resistance to antimicrobial medicines and chemical compounds.
Functional Genomics Platform, A Cloud-Based Platform for Studying Microbial Life at Scale
The Functional Genomics Platform is described, a comprehensive database relating genotype to phenotype for bacterial life and all of the many-to-many connections between each biological entity including the originating genome, gene, protein, and protein domain.
SynBioStrainFinder: A microbial strain database of manually curated CRISPR/Cas genetic manipulation system information for biomanufacturing
SynBioStrainFinder is the first microbial strain database with manually curated information on the strain CRISPR/Cas system as well as other microbial strain information that provides reference information for the construction of new CRISpr/Cas systems.
The Majority of Active Rhodobacteraceae in Marine Sediments Belong to Uncultured Genera: A Molecular Approach to Link Their Distribution to Environmental Conditions
The general composition of active Rhodobacteraceae communities was found to be specific for the geographical location, exhibiting a decreasing richness with sediment depth and one-third of the Rhodobacteria-OTUs significantly responded to the prevailing redox regime, suggesting an adaption to anoxic conditions.
Investigation of next‐generation sequencing data of Klebsiella pneumoniae using web‐based tools
Applying appropriate web‐based online tools to NGS data enables the rapid extraction of comprehensive information that can be used for more efficient diagnosis and treatment of patients, while data processing is free of charge, easy and time‐efficient.
Omics Data Integration in Microbial Research for Agricultural and Environmental Applications
Omics-aided research in microbial and plant sciences genuinely help to consider that people are exploring novel scientific and technological systems to improve human health, human food and animal feed production, overall agricultural productivity, and environmental protection.
Existing Challenges and the Need for Authenticated Reference Genomes
The status of ATCC bacterial genome sequences in public databases is surveyed and the implementation of a genome sequencing workflow designed to provide reference-quality whole-genome sequences that are derived from authenticated ATCC materials is described.
Characterization of the small flavin-binding dodecin in the roseoflavin producer Streptomyces davawensis.
The results show that the dodecin of S. davawensis predominantly binds FMN and is neither involved in rose oflavin biosynthesis nor in roseoflavin resistance, indicating that dodecins broadly affect cellular physiology.


SubtiWiki–a database for the model organism Bacillus subtilis that links pathway, interaction and expression information
SubtiExpress, a third module, is created to visualize genome scale transcription data that are of unprecedented quality and density in SubtiWiki, one of the most complete collections of knowledge on a living organism in one single resource.
GeneDB—an annotation database for pathogens
GeneDB ( is a genome database for prokaryotic and eukaryotic pathogens and closely related organisms that combines data from completed and ongoing genome projects with curated annotation, which is readily accessible from a web based resource.
MetaBioME: a database to explore commercially useful enzymes in metagenomic datasets
This work has developed a resource called MetaBioME, a database of CUEs and a comprehensive platform to facilitate homology-based computational identification of novel homologous CUE’s from metagenomic and bacterial genomic datasets, and identified several novelhomologues to knownCUEs that can potentially serve as leads for further experimental verification.
xBASE2: a comprehensive resource for comparative bacterial genomics
The latest version, xBASE 2.0, now provides comprehensive coverage of all bacterial genomes and features an updated modularized backend and an improved user interface, which includes a taxonomy browser and a powerful full-text search facility.
Global catalogue of microorganisms (gcm): a comprehensive database and information retrieval, analysis, and visualization system for microbial resources
A comprehensive dynamic database of microbial resources has been created, which unveils the resources preserved in culture collections especially for those whose informatics infrastructures are still under development, which should foster cumulative research.—an integrated protein interaction database for E. coli
TLDR is presented, a resource that combines locally generated interaction and evolutionary datasets with a previously generated knowledgebase, to provide an integrated view of the Escherichia coli interactome.
The MiST2 database: a comprehensive genomics resource on microbial signal transduction
The MiST2 database identifies and catalogs the repertoire of signal transduction proteins in microbial genomes and contains a host of new features and improvements including the following: draft genomes; extracytoplasmic function (ECF) sigma factor protein identification; enhanced classification of signaling proteins; novel, high-quality domain models for identifying histidine kinases and response regulators.
PATRIC, the bacterial bioinformatics database and analysis resource
The Pathosystems Resource Integration Center (PATRIC) is the all-bacterial Bioinformatics Resource Center (BRC) and describes updates to the PATRIC since its initial report in the 2007 NAR Database Issue.
rrnDB: improved tools for interpreting rRNA gene abundance in bacteria and archaea and a new foundation for future development
The redesign of the ribosomal RNA operon copy number database (rrnDB) brings a substantial increase in the number of genomes described, improved curation, mapping of genomes to both NCBI and RDP taxonomies, and refined tools for querying and analyzing these data.
BacillusRegNet: A transcriptional regulation database and analysis platform for Bacillus species
A system is described for the use of a model organism, Bacillus subtilis, to infer genome-wide regulatory networks in less well-studied close relatives and the putative transcription factors, their binding sequences and predicted promoter sequences along with annotations are described.