Learn More
Textpresso is a text-mining system for scientific literature. Its two major features are access to the full text of research papers and the development and use of categories of biological concepts as well as categories that describe or relate objects. A search engine enables the user to search for one or a combination of these categories and/or keywords(More)
Biofilms, or surface-attached microbial communities, are both ubiquitous and resilient in the environment. Although much is known about how biofilms form, develop, and detach, very little is understood about how these events are related to metabolism and its dynamics. It is commonly thought that large subpopulations of cells within biofilms are not actively(More)
Agriculture is being challenged to provide food, and increasingly fuel, for an expanding global population. Producing bioenergy crops on marginal lands--farmland suboptimal for food crops--could help meet energy goals while minimizing competition with food production. However, the ecological costs and benefits of growing bioenergy feedstocks--primarily(More)
Many tools exist in the analysis of bacterial RNA sequencing (RNA-seq) transcriptional profiling experiments to identify differentially expressed genes between experimental conditions. Generally, the workflow includes quality control of reads, mapping to a reference, counting transcript abundance, and statistical tests for differentially expressed genes. In(More)
We present a set of computing tools and techniques that every researcher can and should adopt. These recommendations synthesize inspiration from our own work, from the experiences of the thousands of people who have taken part in Software Carpentry and Data Carpentry workshops over the past six years, and from a variety of other guides. Unlike some other(More)
Extremely large datasets have become routine in biology. However, performing a computational analysis of a large dataset can be overwhelming, especially for novices. Here, we present a step-by-step guide to computing workflows with the biologist end-user in mind. Starting from a foundation of sound data management practices, we make specific recommendations(More)
BACKGROUND The intestinal microbiome represents a complex network of microbes that are important for human health and preventing pathogen invasion. Studies that examine differences in intestinal microbial communities across individuals with and without enteric infections are useful for identifying microbes that support or impede intestinal health. RESULTS(More)
We developed an information retrieval and extraction system that processes the full text of biological papers. The system, called Textpresso, separates text into sentences, labels words and phrases according to an ontology (an organized lexicon), and allows queries to be performed on a database of labeled sentences. The current ontology comprises(More)
  • 1