Learn More
High-throughput sequencing technologies, such as the Illumina Genome Analyzer, are powerful new tools for investigating a wide range of biological and medical questions. Statistical and computational methods are key for drawing meaningful and accurate conclusions from the massive and complex datasets generated by the sequencers. We provide a detailed(More)
Bioconductor is an open-source, open-development software project for the analysis and comprehension of high-throughput data in genomics and molecular biology. The project aims to enable interdisciplinary research, collaboration and rapid development of scientific software. Based on the statistical programming language R, Bioconductor comprises 934(More)
BACKGROUND It is well known that the time from conception to early childhood has importance for health conditions that reach into later stages of life. Recent research supports this view, and diseases such as cardiovascular morbidity, cancer, mental illnesses, asthma, and allergy may all have component causes that act early in life. Exposures in this(More)
MOTIVATION The recently released Infinium HumanMethylation450 array (the '450k' array) provides a high-throughput assay to quantify DNA methylation (DNAm) at ∼450 000 loci across a range of genomic features. Although less comprehensive than high-throughput sequencing-based techniques, this product is more cost-effective and promises to be the most widely(More)
DNA methylation is an important epigenetic modification involved in gene regulation, which can now be measured using whole-genome bisulfite sequencing. However, cost, complexity of the data, and lack of comprehensive analytical tools are major challenges that keep this technology from becoming widely applied. Here we present BSmooth, an alignment, quality(More)
The ability to measure gene expression on a genome-wide scale is one of the most promising accomplishments in molecular biology. Microarrays, the technology that first permitted this, were riddled with problems due to unwanted sources of variability. Many of these problems are now mitigated, after a decade's worth of statistical methodology development. The(More)
Generation of cDNA using random hexamer priming induces biases in the nucleotide composition at the beginning of transcriptome sequencing reads from the Illumina Genome Analyzer. The bias is independent of organism and laboratory and impacts the uniformity of the reads along the transcriptome. We provide a read count reweighting scheme, based on the(More)
As sequencing throughput approaches dozens of gigabases per day, there is a growing need for efficient software for analysis of transcriptome sequencing (RNA-Seq) data. Myrna is a cloud-computing pipeline for calculating differential gene expression in large RNA-Seq datasets. We apply Myrna to the analysis of publicly available data sets and assess the(More)
Aging and sun exposure are the leading causes of skin cancer. It has been shown that epigenetic changes, such as DNA methylation, are well established mechanisms for cancer, and also have emerging roles in aging and common disease. Here, we directly ask whether DNA methylation is altered following skin aging and/or chronic sun exposure in humans. We compare(More)
Variation in the social environment is a fundamental component of many vertebrate societies. In humans and other primates, adverse social environments often translate into lasting physiological costs. The biological mechanisms associated with these effects are therefore of great interest, both for understanding the evolutionary impacts of social behavior(More)