Halvade-RNA: Parallel variant calling from transcriptomic data using MapReduce

  title={Halvade-RNA: Parallel variant calling from transcriptomic data using MapReduce},
  author={Dries Decap and J. Reumers and Charlotte Herzeel and Pascal Costanza and J. Fostier},
  journal={PLoS ONE},
  • Dries Decap, J. Reumers, +2 authors J. Fostier
  • Published 2017
  • Computer Science, Medicine
  • PLoS ONE
  • Given the current cost-effectiveness of next-generation sequencing, the amount of DNA-seq and RNA-seq data generated is ever increasing. One of the primary objectives of NGS experiments is calling genetic variants. While highly accurate, most variant calling pipelines are not optimized to run efficiently on large data sets. However, as variant calling in genomic data has become common practice, several methods have been proposed to reduce runtime for DNA-seq analysis through the use of parallel… CONTINUE READING
    14 Citations

    Figures, Tables, and Topics from this paper

    SparkRA: Enabling Big Data Scalability for the GATK RNA-seq Pipeline with Apache Spark
    • 1
    • Highly Influenced
    • PDF
    Cloud accelerated alignment and assembly of full-length single-cell RNA-seq data using Falco
    • 1
    • PDF
    Cloud based computing technologies for genomic medicine
    • PDF
    A Fast and Scalable Workflow for SNPs Detection in Genome Sequences Using Hadoop Map-Reduce
    • Highly Influenced
    • PDF
    Processing next generation sequencing data in map-reduce framework using hadoop-BAM in a computer cluster
    • 4


    Halvade: scalable sequence analysis with MapReduce
    • 63
    • PDF
    Systematic evaluation of spliced alignment programs for RNA-seq data
    • 441
    • PDF
    Reliable identification of genomic variants from RNA-seq data.
    • 258
    STAR: ultrafast universal RNA-seq aligner
    • 14,544
    • PDF
    CloudBurst: highly sensitive read mapping with MapReduce
    • M. Schatz
    • Computer Science, Medicine
    • Bioinform.
    • 2009
    • 650
    • PDF
    Supercomputing for the parallelization of whole genome analysis
    • 48
    • PDF