Errors in scientific results due to software bugs are not limited to a few high-profile cases that lead to retractions and are widely reported. Here I estimate that in fact most scientific results are probably wrong if data have passed through a computer, and that these errors may remain largely undetected. The opportunities for both subtle and profound(More)
Lampreys are extant representatives of the jawless vertebrate lineage that diverged from jawed vertebrates around 500 million years ago. Lamprey genomes contain information crucial for understanding the evolution of gene families in vertebrates. The ATP-binding cassette (ABC) gene family is found from prokaryotes to eukaryotes. The recent availability of(More)
We review currently available technologies for deconvoluting metagenomic data into individual genomes that represent populations, strains, or genotypes present in the community. An evaluation of chromosome conformation capture (3C) and related techniques in the context of metagenomics is presented, using mock microbial communities as a reference. We provide(More)
We present an open implementation of the HyperLogLog cardinality estimation sketch for counting fixed-length substrings of DNA strings (" k-mers "). The HyperLogLog sketch implementation is in C++ with a Python interface, and is distributed as part of the khmer software package. khmer is freely available from https://github.com/dib-lab/khmer under a BSD(More)
