Cydney B. Nielsen

Learn More
Neurospora crassa is a central organism in the history of twentieth-century genetics, biochemistry and molecular biology. Here, we report a high-quality draft sequence of the N. crassa genome. The approximately 40-megabase genome encodes about 10,000 protein-coding genes--more than twice as many as in the fission yeast Schizosaccharomyces pombe and only(More)
MOTIVATION Whole transcriptome shotgun sequencing data from non-normalized samples offer unique opportunities to study the metabolic states of organisms. One can deduce gene expression levels using sequence coverage as a surrogate, identify coding changes or discover novel isoforms or transcripts. Especially for discovery of novel events, de novo assembly(More)
Although it is known that the methylation of DNA in 5' promoters suppresses gene expression, the role of DNA methylation in gene bodies is unclear. In mammals, tissue- and cell type-specific methylation is present in a small percentage of 5' CpG island (CGI) promoters, whereas a far greater proportion occurs across gene bodies, coinciding with highly(More)
Core RNA-processing reactions in eukaryotic cells occur cotranscriptionally in a chromatin context, but the relationship between chromatin structure and pre-mRNA processing is poorly understood. We observed strong nucleosome depletion around human polyadenylation sites (PAS) and nucleosome enrichment just downstream of PAS. In genes with multiple(More)
Vertebrate mRNAs are frequently targeted for post-transcriptional repression by microRNAs (miRNAs) through mechanisms involving pairing of 3' UTR seed matches to bases at the 5' end of miRNAs. Through analysis of expression array data following miRNA or siRNA overexpression or inhibition, we found that mRNA fold change increases multiplicatively (i.e.,(More)
MicroRNAs (miRNAs) are 19–22-nucleotide noncoding RNAs that post-transcriptionally regulate mRNA targets. We have identified endogenous miRNA binding sites in mouse embryonic stem cells (mESCs), by performing photo-cross-linking immunoprecipitation using antibodies to Argonaute (Ago2) followed by deep sequencing of RNAs (CLIP-seq). We also performed(More)
Little is known about the patterns of intron gain and loss or the relative contributions of these two processes to gene evolution. To investigate the dynamics of intron evolution, we analyzed orthologous genes from four filamentous fungal genomes and determined the pattern of intron conservation. We developed a probabilistic model to estimate the most(More)
IMPORTANCE E-cadherin (CDH1) is a cancer predisposition gene mutated in families meeting clinically defined hereditary diffuse gastric cancer (HDGC). Reliable estimates of cancer risk and spectrum in germline mutation carriers are essential for management. For families without CDH1 mutations, genetic-based risk stratification has not been possible,(More)
To the Editor: Advances in next-generation sequencing have reshaped the landscape of genomic and epigenomic research. Large consortia such as the Encyclopedia of DNA Elements, the Roadmap Epigenomics Mapping Consortium and The Cancer Genome Atlas have generated tens of thousands of sequencingbased genome-wide datasets, creating a reference and resource for(More)
As our ability to generate sequencing data continues to increase, data analysis is replacing data generation as the rate-limiting step in genomics studies. Here we provide a guide to genomic data visualization tools that facilitate analysis tasks by enabling researchers to explore, interpret and manipulate their data, and in some cases perform on-the-fly(More)