Data Set Used
We propose several means for improving the performance and training of neural networks for classification. We use crossvali-dation as a tool for optimizing network parameters and architecture. We show further that the remaining residual " generalization " error can be reduced by invoking ensembles of similar networks.
Recent studies have highlighted the surprising richness of soil bacterial communities; however, bacteria are not the only microorganisms found in soil. To our knowledge, no study has compared the diversities of the four major microbial taxa, i.e., bacteria, archaea, fungi, and viruses, from an individual soil sample. We used metagenomic and small-subunit… (More)
Viruses are the most common biological entities in the marine environment. There has not been a global survey of these viruses, and consequently, it is not known what types of viruses are in Earth's oceans or how they are distributed. Metagenomic analyses of 184 viral assemblages collected over a decade and representing 68 sites in four major oceanic… (More)
Here we present the first metagenomic analyses of an uncultured viral community from human feces, using partial shotgun sequencing. Most of the sequences were unrelated to anything previously reported. The recognizable viruses were mostly siphophages, and the community contained an estimated 1,200 viral genotypes.
Mucosal surfaces are a main entry point for pathogens and the principal sites of defense against infection. Both bacteria and phage are associated with this mucus. Here we show that phage-to-bacteria ratios were increased, relative to the adjacent environment, on all mucosal surfaces sampled, ranging from cnidarians to humans. In vitro studies of tissue… (More)
Viruses, most of which are phage, are extremely abundant in marine sediments, yet almost nothing is known about their identity or diversity. We present the metagenomic analysis of an uncultured near-shore marine-sediment viral community. Three-quarters of the sequences in the sample were not related to anything previously reported. Among the sequences that… (More)
Metagenomic sequencing of DNA viruses from the feces of a healthy week-old infant revealed a viral community with extremely low diversity. The identifiable sequences were dominated by phages, which likely influence the diversity and abundance of co-occurring microbes. The most abundant fecal viral sequences did not originate from breast milk or formula,… (More)
Viruses are the most common biological entities in the oceans by an order of magnitude. However, very little is known about their diversity. Here we report a genomic analysis of two uncultured marine viral communities. Over 65% of the sequences were not significantly similar to previously reported sequences, suggesting that much of the diversity is… (More)
BACKGROUND Phages, viruses that infect prokaryotes, are the most abundant microbes in the world. A major limitation to studying these viruses is the difficulty of cultivating the appropriate prokaryotic hosts. One way around this limitation is to directly clone and sequence shotgun libraries of uncultured viral communities (i.e., metagenomic analyses).… (More)
The species composition and metabolic potential of microbial and viral communities are predictable and stable for most ecosystems. This apparent stability contradicts theoretical models as well as the viral-microbial dynamics observed in simple ecosystems, both of which show Kill-the-Winner behavior causing cycling of the dominant taxa. Microbial and viral… (More)