A complete domain-to-species taxonomy for Bacteria and Archaea

@article{Parks2020ACD,
  title={A complete domain-to-species taxonomy for Bacteria and Archaea},
  author={Donovan H. Parks and Maria Chuvochina and Pierre-Alain Chaumeil and Christian Rinke and Aaron J. Mussig and P. Bernt Hugenholtz},
  journal={Nature Biotechnology},
  year={2020},
  pages={1 - 8}
}
The Genome Taxonomy Database is a phylogenetically consistent, genome-based taxonomy that provides rank-normalized classifications for ~150,000 bacterial and archaeal genomes from domain to genus. However, almost 40% of the genomes in the Genome Taxonomy Database lack a species name. We address this limitation by using commonly accepted average nucleotide identity criteria to set bounds on species and propose species clusters that encompass all publicly available bacterial and archaeal genomes… Expand

Figures from this paper

A standardized archaeal taxonomy for the Genome Taxonomy Database.
TLDR
A standardized archaeal taxonomy is proposed that is derived from a 122-concatenated-protein phylogeny that resolves polyphyletic groups and normalizes ranks based on relative evolutionary divergence and is shown to robustly correct for substitution rates varying up to 30-fold using simulated datasets. Expand
GTDB: an ongoing census of bacterial and archaeal diversity through a phylogenetically consistent, rank normalized and complete genome-based taxonomy.
TLDR
This work discusses improvements to the GTDB website which allow tracking of taxonomic changes, easy assessment of genome assembly quality, and identification of genomes assembled from type material or used as species representatives and discusses the use of average nucleotide identities as a pragmatic approach for delineating prokaryotic species. Expand
Complete genomes derived by directly sequencing freshwater bloom populations emphasize the significance of the genus level ADA clade within the Nostocales.
TLDR
Data is presented indicating that the ADA clade is a highly significant component of current cyanobacterial HABs, including members assigned to the prevalent Dolichospermum and Aphanizomenon genera, as well as Cuspidothrix and Anabaena. Expand
Diversity, function and evolution of marine microbe genomes
TLDR
The database provides a comprehensive resource for marine microbiome, which would be a valuable reference for studies of marine life origination and evolution, ecology monitor and protection, bioactive compound development. Expand
Roadmap for naming uncultivated Archaea and Bacteria
TLDR
The authors discuss the issue of naming uncultivated prokaryotic microorganisms, which currently do not have a formal nomenclature system due to a lack of type material or cultured representatives, and propose two recommendations including the recognition of DNA sequences as type material. Expand
Microbial Phylogenetic Context Using Phylogenetic Outlines
TLDR
A new interactive graphical tool is provided that addresses the phylogenetic context of a draft genome using Mash sketches to compare against all bacterial and archaeal representative genomes in the Genome Taxonomy Database taxonomy, all within the framework of SplitsTree5. Expand
Amplicon Sequence Variants Artificially Split Bacterial Genomes into Separate Clusters
TLDR
This research highlights the risk of splitting a single bacterial genome into separate clusters when ASVs are used to analyze 16S rRNA gene sequence data. Expand
Prokaryotic taxonomy and nomenclature in the age of big sequence data
TLDR
The challenge ahead is to reach a consensus on the taxonomic framework and to adapt and scale the existing nomenclatural code, or create a new code, to systematically incorporate uncultured taxa into the chosen framework. Expand
Defining the Rhizobium leguminosarum Species Complex
TLDR
It is concluded that access to a large number of genome sequences is a powerful tool for characterizing the diversity of bacteria, and that taxonomic conclusions should be based on all available genome sequences, not just those of type strains. Expand
The Hidden Genomic Diversity, Specialized Metabolite Capacity, and Revised Taxonomy of Burkholderia Sensu Lato
TLDR
The 95% ANI threshold proved capable of delineating most genomic species and was only increased to resolve several closely related species, enabling the assessment of species classifications of over 4000 genomes, and the correction of over 400 genome taxonomic assignments in public databases into existing and uncharacterized genomic species groups. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 68 REFERENCES
A Genus Definition for Bacteria and Archaea Based on a Standard Genome Relatedness Index
TLDR
Genetic coherence is an emergent property of genera in Bacteria and Archaea that relies on the combined use of the average nucleotide identity, genome alignment fraction, and the distinction between type- and non-type species in this study. Expand
A genus definition for Bacteria and Archaea based on genome relatedness and taxonomic affiliation
TLDR
Results show that a distinct difference between distant relatives and close relatives at the genome level (i.e., genomic coherence) is an emergent property of genera in Bacteria and Archaea. Expand
A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life
TLDR
This work used a concatenated protein phylogeny as the basis for a bacterial taxonomy that conservatively removes polyphyletic groups and normalizes taxonomic ranks on the basis of relative evolutionary divergence. Expand
Towards a Genome-Based Taxonomy for Prokaryotes
TLDR
The AAI-based approach provides a means to evaluate the robustness of alternative genetic markers for phylogenetic purposes, and could contribute significantly to a genome-based taxonomy for all microbial organisms. Expand
Microbial species delineation using whole genome sequences
TLDR
This work demonstrates that the combination of gANI and the alignment fraction between two genomes accurately reflects their genomic relatedness, and proposes this precise and objective AF,gANI-based species definition: the MiSI (Microbial Species Identifier) method, to be used to address previous inconsistencies in species classification. Expand
An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea
TLDR
A ‘taxonomy to tree’ approach for transferring group names from an existing taxonomy to a tree topology is developed and used to apply the Greengenes, National Center for Biotechnology Information (NCBI) and cyanoDB (Cyanobacteria only) taxonomies to a de novo tree comprising 408 315 sequences. Expand
Genomic insights that advance the species definition for prokaryotes.
TLDR
The average nucleotide identity of the shared genes between two strains was found to be a robust means to compare genetic relatedness among strains, and that ANI values of approximately 94% corresponded to the traditional 70% DNA-DNA reassociation standard of the current species definition. Expand
Consistent metagenome-derived metrics verify and define bacterial species boundaries
TLDR
The results verify the existence of sequence-discrete microbial species in metagenome-derived genomes and highlight the usefulness of ribosomal genes for gene-level species discrimination. Expand
High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries
TLDR
FastANI is developed, a method to compute ANI using alignment-free approximate sequence mapping, and it is shown 95% ANI is an accurate threshold for demarcating prokaryotic species by analyzing about 90,000 proKaryotic genomes. Expand
1,003 reference genomes of bacterial and archaeal isolates expand coverage of the tree of life
We present 1,003 reference genomes that were sequenced as part of the Genomic Encyclopedia of Bacteria and Archaea (GEBA) initiative, selected to maximize sequence coverage of phylogenetic space.Expand
...
1
2
3
4
5
...