Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications

@article{Yilmaz2011MinimumIA,
  title={Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications},
  author={Pelin Yilmaz and Renzo Kottmann and Dawn Field and Rob Knight and James R. Cole and Linda A. Amaral-Zettler and Jack A. Gilbert and Ilene Karsch-Mizrachi and Anjanette Johnston and Guy Cochrane and Robert Vaughan and Chris I. Hunter and Joonhong Park and Norman Morrison and Philippe Rocca-Serra and Peter Sterk and Manimozhiyan Arumugam and Mark J. Bailey and Laura K. Baumgartner and Bruce W. Birren and Martin J. Blaser and Vivien Bonazzi and Tim Booth and Peer Bork and Frederic D. Bushman and Pier Luigi Buttigieg and Patrick S. G. Chain and Emily S. Charlson and Elizabeth K. Costello and Heather Huot-Creasy and Peter Dawyndt and Todd Z. DeSantis and Noah Fierer and Jed A. Fuhrman and Rachel E. Gallery and Dirk Gevers and Richard A. Gibbs and Inigo San Gil and Antonio Ruvalcaba González and Jeffrey I. Gordon and Robert P. Guralnick and Wolfgang Hankeln and Sarah K. Highlander and P. Bernt Hugenholtz and Janet K. Jansson and Andrew L. Kau and Scott T. Kelley and Jerry Kennedy and Dan Knights and Omry Koren and Justin Kuczynski and Nikos C. Kyrpides and Robert Larsen and Christian L. Lauber and Teresa Marie Legg and Ruth E. Ley and Catherine A. Lozupone and Wolfgang Ludwig and Donna Lyons and Eamonn James Maguire and Barbara A. Meth{\'e} and Folker Meyer and Brian D. Muegge and Sara Nakielny and Karen E. Nelson and Diana R. Nemergut and Josh D. Neufeld and Lindsay K. Newbold and Anna E. Oliver and Norman R. Pace and Giriprakash Palanisamy and J{\"o}rg Peplies and Joseph F Petrosino and Lita M. Proctor and Elmar Pruesse and Christian Quast and Jeroen Raes and Sujeevan Ratnasingham and Jacques Ravel and David A. Relman and Susanna Assunta-Sansone and Patrick D. Schloss and Lynn M. Schriml and Rohini Sinha and Michelle I. Smith and Erica Sodergren and Aym{\'e} Spor and Jesse Stombaugh and James M. Tiedje and Doyle V. Ward and George M. Weinstock and Doug Wendel and Owen White and Andrew S Whiteley and Andreas Wilke and Jennifer R. Wortman and Tanya Yatsunenko and Frank Oliver Gl{\"o}ckner},
  journal={Nature Biotechnology},
  year={2011},
  volume={29},
  pages={415-420}
}
Here we present a standard developed by the Genomic Standards Consortium (GSC) for reporting marker gene sequences—the minimum information about a marker gene sequence (MIMARKS). We also introduce a system for describing the environment from which a biological sample originates. The 'environmental packages' apply to any genome sequence of known origin and can be used in combination with MIMARKS and other GSC checklists. Finally, to establish a unified standard for describing sequence data and… 
Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea
TLDR
Two standards developed by the Genomic Standards Consortium (GSC) for reporting bacterial and archaeal genome sequences are presented, including the Minimum Information about a Single Amplified Genome (MISAG) and the Minimum information about a Metagenome-Assembled Genomes (MIMAG), including estimates of genome completeness and contamination.
Minimum Information about an Uncultivated Virus Genome (MIUViG)
TLDR
Community-wide adoption of MIUViG standards, which complement the Minimum Information about a Single Amplified Genome and Metagenome-Assembled Genome standards, will improve the reporting of uncultivated virus genomes in public databases, and should enable more robust comparative studies and a systematic exploration of the global virosphere.
The Global Genome Biodiversity Network (GGBN) Data Standard specification
TLDR
The new data standard presented here build upon existing standards commonly used within the community extending them with the capability to exchange data on tissue, environmental and DNA sample as well as sequences.
SeqDB: Biological Collection Management with Integrated DNA Sequence Tracking
TLDR
The overall system, the close collaboration with scientists and technicians undertaking molecular research involving the national collection, and the centralization of their data sets in an access controlled relational database implementing internationally recognized standards are described.
FAIR enough? A perspective on the status of nucleotide sequence data and metadata on public archives
TLDR
The status of metadata available for raw read Illumina amplicon and whole genome shotgun sequencing data derived from ecological metagenomic material that are accessible at the European Nucleotide Archive (ENA), as well as the compliance of the primary sequence data (fastq files) with data submission requirements are reviewed.
Using MIxS: An Implementation Report from Two Metagenomic Information Systems
TLDR
This work will present the experience implementing MIxS in two metagenomic information systems – the Earth Microbiome Project (EMP) and the Government of Canada (GoC) Ecobiomics project, and describe issues surrounding ENVO (the Environment Ontology), which is required for filling out the mandatory MIXS fields "Environmental material", "Biome", and "Environmental feature".
DDBJ new system and service refactoring
TLDR
The DNA data bank of Japan (DDBJ) maintains a primary nucleotide sequence database and provides analytical resources for biological information to researchers and has achieved significant improvements in system performance.
Building a global genomics observatory: Using GEOME (the Genomic Observatories Metadatabase) to expedite and improve deposition and retrieval of genetic data and metadata for biodiversity research.
TLDR
How GEOME can enable genuinely open data workflows for researchers in the field of molecular ecology is described.
MetaCOXI: an integrated collection of metazoan mitochondrial cytochrome oxidase subunit-I DNA sequences
TLDR
MetaCOXI is an integrated collection of curated metazoan COXI DNA sequences with their associated harmonized taxonomy and metadata, built on the two most extensive available data resources, namely the European Nucleotide Archive and the Barcode of Life Data System.
Standardized Metadata for Human Pathogen/Vector Genomic Sequences
TLDR
The use of this metadata standard by all ongoing and future GSCID sequencing projects will provide a consistent representation of these data in the BRC resources and other repositories that leverage these data, allowing investigators to identify relevant genomic sequences and perform comparative genomics analyses that are both statistically meaningful and biologically relevant.
...
...

References

SHOWING 1-10 OF 33 REFERENCES
The minimum information about a genome sequence (MIGS) specification
TLDR
Here, the minimum information about a genome sequence (MIGS) specification is introduced with the intent of promoting participation in its development and discussing the resources that will be required to develop improved mechanisms of metadata capture and exchange.
A standard MIGS/MIMS compliant XML Schema: toward the development of the Genomic Contextual Data Markup Language (GCDML).
TLDR
The need for a sample-centric, strongly-typed schema, which will provide a diverse set of descriptors for describing the exact origin and processing of a biological sample, from sampling to sequencing, and subsequent analysis is described.
Data Standards for BARCODE Records in INSDC (BRIs)
TLDR
A proposal for new data standards that would apply to DNA barcode records submitted to INSDC members in the future was presented at the INSDC annual meeting where it was greeted with strong support and swift approval.
GenBank
TLDR
GenBank® is a comprehensive database that contains publicly available nucleotide sequences for over 340 000 formally described species and integrates these records with a variety of other data including taxonomy nodes, genomes, protein structures, and biomedical journal literature in PubMed.
SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB
TLDR
SILVA (from Latin silva, forest), was implemented to provide a central comprehensive web resource for up to date, quality controlled databases of aligned rRNA sequences from the Bacteria, Archaea and Eukarya domains.
Bacterial phylogeny based on comparative sequence analysis (review)
TLDR
It is shown that only careful data analysis starting with a proper alignment, followed by the analysis of positional variability, rates and character of change, testing various data selections, applying alternative treeing methods and, finally, performing confidence tests, allows reasonable utilization of the limited phylogenetic information.
The Ribosomal Database Project: improved alignments and new tools for rRNA analysis
TLDR
An improved alignment strategy uses the Infernal secondary structure aware aligner to provide a more consistent higher quality alignment and faster processing of user sequences, and a new Pyrosequencing Pipeline that provides tools to support analysis of ultra high-throughput rRNA sequencing data.
The uncultured microbial majority.
TLDR
Genome sequence information that would allow ribosomal RNA gene trees to be related to broader patterns in microbial genome evolution is scant, and therefore microbial diversity remains largely unexplored territory.
Study of Genetic Diversity of Eukaryotic Picoplankton in Different Oceanic Regions by Small-Subunit rRNA Gene Cloning and Sequencing
TLDR
The diversity of marine picoeukaryotes was determined by sequencing cloned 18S rRNA genes in five genetic libraries from North Atlantic, Southern Ocean, and Mediterranean Sea surface waters, and the phylogenetic diversity in each library was rather great, and each library included many different OTUs and members of very distantly related phylogenetic groups.
Oceanic 18S rDNA sequences from picoplankton reveal unsuspected eukaryotic diversity
TLDR
35 full sequences of the small-subunit (18S) ribosomal RNA gene derived from a picoplanktonic assemblage collected at a depth of 75 m in the equatorial Pacific Ocean are analysed and show that there is a high diversity of picoeukaryotes.
...
...