Shouji: a fast and efficient pre-alignment filter for sequence alignment
@article{Alser2018ShoujiAF, title={Shouji: a fast and efficient pre-alignment filter for sequence alignment}, author={Mohammed H. Alser and Hasan Hassan and Akash Kumar and Onur Mutlu and Can Alkan}, journal={Bioinformatics}, year={2018} }
MOTIVATION
The ability to generate massive amounts of sequencing data continues to overwhelm the processing capability of existing algorithms and compute infrastructures. In this work, we explore the use of hardware/software co-design and hardware acceleration to significantly reduce the execution time of short sequence alignment, a crucial step in analyzing sequenced genomes. We introduce Shouji, a highly-parallel and accurate pre-alignment filter that remarkably reduces the need for…
Figures and Tables from this paper
59 Citations
Accelerating the Understanding of Life's Code Through Better Algorithms and Hardware Design
- 2019
Computer Science
ArXiv
This thesis introduces four new algorithms that function as a pre-alignment step and aim to filter out most incorrect candidate locations and develops an efficient CPU implementation of the SneakySnake algorithm for commodity desktops and servers.
SneakySnake: A Fast and Accurate Universal Genome Pre-Alignment Filter for CPUs, GPUs, and FPGAs
- 2021
Computer Science
Bioinform.
SneakySnake is introduced, a highly parallel and highly accurate pre-alignment filter that remarkably reduces the need for computationally costly sequence alignment and is efficient to implement on CPUs, GPUs, and FPGAs.
WFA-GPU: Gap-affine pairwise alignment using GPUs
- 2023
Computer Science
bioRxiv
The WFA-GPU is presented, a GPU (Graphics Processing Unit)-accelerated tool to compute exact gap-affine alignments based on the WFA algorithm, and the algorithmic adaptations and performance optimizations that allow exploiting the massively parallel capabilities of modern GPU devices to accelerate the alignment computations are presented.
Scrooge: a fast and memory-frugal genomic sequence aligner for CPUs, GPUs, and ASICs
- 2023
Computer Science
Bioinform.
This work identifies and addresses three inefficiencies in the GenASM algorithm: it has a high amount of data movement, a large memory footprint, and does some unnecessary work, and proposes Scrooge, a fast and memory-frugal genomic sequence aligner.
GateKeeper: a new hardware architecture for accelerating pre‐alignment in DNA short read mapping
- 2017
Computer Science
Bioinform.
GateKeeper is the first design to accelerate pre‐alignment using Field‐Programmable Gate Arrays (FPGAs), which can perform pre-alignment much faster than software, and maintains high accuracy while providing 90‐fold and 130‐fold speedup over the state‐of‐the‐art software pre‐Alignment techniques, Adjacency Filter and Shifted Hamming Distance (SHD), respectively.
Multiprocess Implementation of DNA Pre-alignment Filtering using the Bit Matrix Algorithm
- 2020
Computer Science
2020 IEEE 12th International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment, and Management (HNICEM)
This manuscript focuses on the bit matrix pre-alignment filter with a goal of improving the implementation by using multiprocessing techniques that exploit multi-core CPUs.
DiagAF: A More Accurate and Efficient Pre-Alignment Filter for Sequence Alignment
- 2022
Computer Science
IEEE/ACM Transactions on Computational Biology and Bioinformatics
A more accurate and efficient pre-alignment algorithm for sequence alignment, called DiagAF, which uses a new lower bound of edit distance based on shift hamming masks and can deal with alignments of sequence pairs with not equal length rather than state-of-the-art methods just for equal length.
Proposal of Smith-Waterman algorithm on FPGA to accelerate the forward and backtracking steps
- 2022
Computer Science
PloS one
This work proposes a parallel hardware design for the SW algorithm with a systolic array structure to accelerate the forward and backtracking steps and calculates and stores the paths in the forward stage for pre-organizing the alignment, which reduces the complexity of the backtracking stage.
Accelerating Genome Sequence Analysis via Efficient Hardware/Algorithm Co-Design
- 2021
Computer Science, Biology
ArXiv
Overall, it is demonstrated that genome sequence analysis can be accelerated by codesigning scalable and energy-efficient customized accelerators along with efficient algorithms for the key steps of genome sequenceAnalysis.
FPGA Acceleration of Pre-Alignment Filters for Short Read Mapping With HLS
- 2022
Computer Science
IEEE Access
This work implements and optimize several state-of-the-art pre-alignment filters using C/C++ based-HLS to expand their portability to a wide range of systems supporting the OpenCL runtime and performs a complete analysis of the performance and accuracy of the filters.
58 References
ASAP: Accelerated Short-Read Alignment on Programmable Hardware
- 2019
Computer Science
IEEE Transactions on Computers
This paper explores the use of hardware acceleration to significantly improve the runtime of short-read alignment, a crucial step in preprocessing sequenced genomes by focusing on the Levenshtein distance (edit-distance) computation kernel and proposing the ASAP accelerator.
Accelerating the Next Generation Long Read Mapping with the FPGA-Based System
- 2014
Computer Science
IEEE/ACM Transactions on Computational Biology and Bioinformatics
A novel FPGA-based system to improve the efficiency of the long read mapping and reduces the alignment cycles from weeks to days, compared to the state-of-the-art long read aligner BWA-SW.
GateKeeper: a new hardware architecture for accelerating pre‐alignment in DNA short read mapping
- 2017
Computer Science
Bioinform.
GateKeeper is the first design to accelerate pre‐alignment using Field‐Programmable Gate Arrays (FPGAs), which can perform pre-alignment much faster than software, and maintains high accuracy while providing 90‐fold and 130‐fold speedup over the state‐of‐the‐art software pre‐Alignment techniques, Adjacency Filter and Shifted Hamming Distance (SHD), respectively.
merAligner: A Fully Parallel Sequence Aligner
- 2015
Computer Science
2015 IEEE International Parallel and Distributed Processing Symposium
This work presents merAligner, a highly parallel sequence aligner that implements a seed -- and -- extend algorithm and employs parallelism in all of its components and efficiently parallelize the I/O intensive phases and implement an effective load balancing scheme.
MAGNET: Understanding and Improving the Accuracy of Genome Pre-Alignment Filtering
- 2017
Computer Science
MAGNET is proposed, a new filtering strategy that maintains high accuracy across different edit distance thresholds and data sets and significantly improves the accuracy of pre-alignment filtering by one to two orders of magnitude.
A Survey of Software and Hardware Approaches to Performing Read Alignment in Next Generation Sequencing
- 2017
Computer Science, Biology
IEEE/ACM Transactions on Computational Biology and Bioinformatics
A brief biological introduction to genomics and NGS is provided and the most popular next generation read alignment tools and algorithms are discussed, along with a comprehensive survey of the hardware implementations used to accelerate these algorithms.
FPGASW: Accelerating Large-Scale Smith–Waterman Sequence Alignment Application with Backtracking on FPGA Linear Systolic Array
- 2017
Computer Science
Interdisciplinary Sciences: Computational Life Sciences
This paper proposes fine-grained parallelized SW algorithms using affine gap penalty and implements a parallel computing structures to accelerating the SW with backtracking on FPGA platform, and analysis the dynamic parallel computing features of anti-diagonal elements and storage expansion problem resulting from backtracking stage.
GSWABE: faster GPU‐accelerated sequence alignment with optimal alignment retrieval for short DNA sequences
- 2015
Computer Science
Concurr. Comput. Pract. Exp.
This work investigates a general tile‐based approach to facilitating fast alignment by deeply exploring the powerful compute capability of CUDA‐enabled GPUs and presents GSWABE, a graphics processing unit (GPU)‐accelerated pairwise sequence alignment algorithm for a collection of short DNA sequences.
Hardware-Acceleration of Short-Read Alignment Based on the Burrows-Wheeler Transform
- 2016
Computer Science
IEEE Transactions on Parallel and Distributed Systems
An FPGA-based hardware accelerator that can align a few hundred million short DNA fragments in an hour by using 80 processing elements in parallel is proposed and has the same mapping quality compared to the software-based methods.
Reconfigurable acceleration of genetic sequence alignment: A survey of two decades of efforts
- 2017
Computer Science
2017 27th International Conference on Field Programmable Logic and Applications (FPL)
A thorough survey of the proposed accelerators is provided by giving a qualitative categorization based on their algorithms and speedup so as to guide selection for biologist, and to provide insight on future research direction for FPGA scientists.