FRAMA: from RNA-seq data to annotated mRNA assemblies

@article{Bens2015FRAMAFR,
  title={FRAMA: from RNA-seq data to annotated mRNA assemblies},
  author={M. Bens and A. Sahm and M. Groth and N. Jahn and M. Morhart and S. Holtze and T. Hildebrandt and M. Platzer and K. Szafranski},
  journal={BMC Genomics},
  year={2015},
  volume={17}
}
BackgroundAdvances in second-generation sequencing of RNA made a near-complete characterization of transcriptomes affordable. However, the reconstruction of full-length mRNAs via de novo RNA-seq assembly is still difficult due to the complexity of eukaryote transcriptomes with highly similar paralogs and multiple alternative splice variants. Here, we present FRAMA, a genome-independent annotation tool for de novo mRNA assemblies that addresses several post-assembly tasks, such as reduction of… Expand
CAARS: comparative assembly and annotation of RNA-Seq data
TLDR
An automated pipeline named CAARS is built to combine novel data from RNA-Seq experiments with existing multi-species gene family alignments, and showed CAARS assemblies are more complete and accurate than those assembled by a standard pipeline coupled with annotation by sequence similarity on a guide species. Expand
Compacting and correcting Trinity and Oases RNA-Seq de novo assemblies
Background De novo transcriptome assembly of short reads is now a common step in expression analysis of organisms lacking a reference genome sequence. Several software packages are available toExpand
Combining protein-based transcriptome assembly, and efficient MinION long read sequencing for targeted transcript sequencing in orphan species. Validation on herbicide targets and low copy number genes in Gymnosperms, Juncaceae and Pteridophyta
TLDR
A protein-based pipeline was developed for transcript assembly between distantly related species and was tested on three evolutionarily divergent species with little sequence information available for them and for which the closest genome representatives were at least 40 million years divergent. Expand
De novo transcriptome assembly for the spiny mouse (Acomys cahirinus)
TLDR
This study provides the first detailed characterization of the spiny mouse transcriptome, validates use of the EvidentialGene tr2aacds pipeline in mammals to augment conventional de novo assembly approaches, and provides a valuable scientific resource for further investigation into the unique physiological characteristics inherent in the genus Acomys. Expand
De novo transcriptome assembly for the spiny mouse (Acomys cahirinus)
TLDR
This study provides the first detailed characterization of the spiny mouse transcriptome, validates the application of the EvidentialGene ‘tr2aacds’ pipeline to generate a high-quality reference transcriptome assembly in a mammalian species, and provides a valuable scientific resource for further investigation into the unique physiological characteristics inherent in the genus Acomys. Expand
Parallel evolution of genes controlling mitonuclear balance in short‐lived annual fishes
TLDR
The pattern of positive selection in annual and nonannual African killifishes is investigated to identify a genomic substrate for evolution of annual life history (and reduced lifespan) and some genes under positive selection also in long‐lived mammals such as bats and mole‐rats are identified. Expand
Species comparison of liver proteomes reveals enhanced lipid metabolism, reduced mitochondrial respiration and enhanced expression of detoxifying enzymes in the long-lived naked mole-rat
TLDR
It is found that NMRs express substantially higher levels of a restricted number of longevity-associated proteins that confer enhanced buffering against oxidative stress, and NMR livers display a unique expression pattern of mitochondrial proteins that result in distinct metabolic features of their mitochondria. Expand
Analysis of the coding sequences of clownfish reveals molecular convergence in the evolution of lifespan
TLDR
These results indicate that specific pathways were recruited in independent lineages evolving an exceptionally extended or shortened lifespan and point to mito-nuclear balance as a key factor. Expand
Species comparison of liver proteomes reveals links to naked mole-rat longevity and human aging
TLDR
It is shown at the molecular level that livers of NMRs display progressive age-dependent changes that recapitulate typical signatures of aging despite the negligible senescence and extraordinary longevity of these rodents. Expand
Convergent evolution of genes controlling mitonuclear balance in annual fishes
TLDR
Signs of convergent evolution are observed in four out of five steps, which strongly indicates that genes under positive selection in all mitonuclear balance are preferential genetic targets for the evolution of short lifespan and annual life cycle. Expand
...
1
2
3
...

References

SHOWING 1-10 OF 38 REFERENCES
Full-length transcriptome assembly from RNA-Seq data without a reference genome.
TLDR
The Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available, providing a unified solution for transcriptome reconstruction in any sample. Expand
Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels
TLDR
A software package named Oases designed to heuristically assemble RNA-seq reads in the absence of a reference genome, across a broad spectrum of expression values and in presence of alternative isoforms is presented. Expand
Systematic evaluation of spliced alignment programs for RNA-seq data
TLDR
A comparison of 26 mapping protocols based on 11 programs and pipelines found major performance differences between methods on numerous benchmarks, including alignment yield, basewise accuracy, mismatch and gap placement, exon junction discovery and suitability of alignments for transcript reconstruction. Expand
RNA-Seq Assembly – Are We There Yet?
TLDR
The challenges and solutions to the transcriptome assembly problem are discussed, and a list of quality control parameters and the necessary scripts to produce them are provided. Expand
Optimizing de novo transcriptome assembly from short-read RNA-Seq data: a comparative study
TLDR
This work compared the performance of publicly available transcriptome assemblers, and analyzed important factors affecting de novo assembly, and some practical guidelines for transcript reconstruction from short-read RNA-Seq data were proposed. Expand
Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data
TLDR
It is feasible to assemble the hexaploid wheat transcriptome from short reads and the assembled transcripts were comparable to Sanger-derived ESTs in terms of both continuity and accuracy. Expand
Optimization of de novo transcriptome assembly from next-generation sequencing data.
TLDR
Two new methods for substantially improving transcriptome de novo assembly were used to assemble successfully the transcripts of the core set of genes regulating tooth development in vertebrates, while classic de noVO assembly failed. Expand
Next-generation transcriptome assembly
TLDR
This Review summarizes the recent developments in transcriptome assembly approaches — reference-based, de novo and combined strategies — along with some perspectives on transcriptomeAssembly in the near future. Expand
A comparison across non-model animals suggests an optimal sequencing depth for de novo transcriptome assembly
TLDR
These assembly trends suggest that representative assemblies are generated with as few as 20 million reads for tissue samples and 30 millionreads for whole-animals for RNA-level coverage, and beyond 60 million reads, the discovery of new genes is low and sequencing errors of highly-expressed genes are likely to accumulate. Expand
Separating homeologs by phasing in the tetraploid wheat transcriptome
TLDR
This study shows that de novo transcriptome assembly of tetraploid wheat benefit from multiple k-mer assembly strategies more than diploids wheat, and demonstrates that phasing approaches originally designed for heterozygous diploid organisms can be used to separate the close homoeologous genomes of tetraspecific wheat. Expand
...
1
2
3
4
...