CUDA-BLASTP: Accelerating BLASTP on CUDA-Enabled Graphics Hardware

@article{Liu2011CUDABLASTPAB,
  title={CUDA-BLASTP: Accelerating BLASTP on CUDA-Enabled Graphics Hardware},
  author={Weiguo Liu and Bertil Schmidt and Wolfgang M{\"u}ller-Wittig},
  journal={IEEE/ACM Transactions on Computational Biology and Bioinformatics},
  year={2011},
  volume={8},
  pages={1678-1684}
}
Scanning protein sequence database is an often repeated task in computational biology and bioinformatics. However, scanning large protein databases, such as GenBank, with popular tools such as BLASTP requires long runtimes on sequential architectures. Due to the continuing rapid growth of sequence databases, there is a high demand to accelerate this task. In this paper, we demonstrate how GPUs, powered by the Compute Unified Device Architecture (CUDA), can be used as an efficient computational… Expand
Mapping of BLASTP Algorithm onto GPU Clusters
TLDR
This paper demonstrates how GPU clusters, powered by the Compute Unified Device Architecture (CUDA), OpenMP, and MPI parallel programming models can be used as an efficient computational platform to accelerate the popular BLASTP algorithm. Expand
CLUS_GPU-BLASTP: accelerated protein sequence alignment using GPU-enabled cluster
TLDR
An accelerated implementation of protein BLAST, i.e., CLUS_GPU-BLASTP for multiple query sequence processing in parallel, on graphical processing unit (GPU)-enabled high-performance cluster is proposed and achieves a speedup ranging from 7.4X to 8.2X. Expand
Accelerating Protein Sequence Search in a Heterogeneous Computing System
TLDR
An implementation of the BLAST algorithm for searching protein sequences in a heterogeneous computing system that delivers a seven-fold speedup over the sequential BLASTP for the most computationally intensive phase on a NVIDIA Fermi C2050 GPU. Expand
Accelerating iterative protein sequence alignment on a heterogeneous GPU-CPU platform
TLDR
A survey on various GPU accelerated BLAST solutions is presented, comparing them with respect to the algorithm stages implemented on GPU, the interaction between GPU and CPU execution paths, the parallelization granularity applied and other effective aspects, and a GPU-CPU multithreaded solution accelerating PSI-BLAST is proposed. Expand
H‐BLAST: a fast protein sequence alignment toolkit on heterogeneous computers with GPUs
TLDR
The heterogeneous BLAST (H‐BLAST), a fast parallel search tool for a heterogeneous computer that couples CPUs and GPUs, is developed to accelerate BLASTX and BLASTP—basic tools of NCBI‐BLast. Expand
cuBLASTP: Fine-Grained Parallelization of Protein Sequence Search on a GPU
TLDR
The proposed cuBLASTP, an efficient fine-grained BLASTP implementation for the GPU using CUDA, encompasses many research contributions, including memory-access reordering to reorder hits from column-major order to diagonal- major order, position-based indexing to map a hit with a packed data structure to a bin, and aggressive hit filtering to eliminate hits beyond the threshold distance along the diagonal. Expand
Parallel Implementation of MAFFT on CUDA-Enabled Graphics Hardware
TLDR
To fully exploit the GPU's capabilities for accelerating MAFFT, the sequence data organization is optimized, the bandwidth bottleneck of memory access is eliminated, a memory allocation and reuse strategy is designed, and a new modified-run-length encoding scheme is proposed to reduce memory consumption. Expand
BLVector: Fast BLAST-Like Algorithm for Manycore CPU With Vectorization
TLDR
BLVector reduces the execution time of BLAST+ when aligning up to mid-size protein sequences (∼750 amino acids), and can be considered complementary bioinformatics tools. Expand
Comparative analysis of protein alignment algorithms in parallel environment using CUDA
TLDR
Experimental results show that parallel implementation of BLAST algorithm mostly range 2-5 times faster than parallel Smith-Waterman, and this paper analyzes both the algorithms in a parallel environment with the help of NVIDIA GPU. Expand
CUDA ClustalW: An efficient parallel algorithm for progressive multiple sequence alignment on Multi-GPUs
TLDR
A GPU version of ClustalW v2.0.0 is proposed, which can achieve more than 33× speedups for overall execution time by comparing to ClustAlW v1.11.0, in this work. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 31 REFERENCES
CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units
TLDR
The CUDASW++ implementation provides a significant performance improvement for Smith-Waterman-based protein sequence database searches by fully exploiting the compute capability of commonly used CUDA-enabled low-cost GPUs. Expand
Mercury BLASTP: Accelerating Protein Sequence Alignment
TLDR
The architecture of the portions of the application that are accelerated in the FPGA are described, and the integration of these FPGa-accelerated portions with the existing BLASTP software is described. Expand
CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment
TLDR
The results show that graphic cards are now sufficiently advanced to be used as efficient hardware accelerators for sequence alignment, and their performance is better than any alternative available on commodity hardware platforms. Expand
High-throughput sequence alignment using Graphics Processing Units
TLDR
MUMmerGPU is a low cost, ultra-fast sequence alignment program designed to handle the increasing volume of data produced by new, high-throughput sequencing technologies, and demonstrates that even memory-intensive applications can run significantly faster on the relatively low-cost GPU than on the CPU. Expand
A Deterministic Finite Automaton for Faster Protein Hit Detection in BLAST
TLDR
This work presents an optimization to the first stage of the BLAST algorithm specifically designed for protein search that produces the same results as NCBI-BLAST but in around 59% of the time on Intel-based platforms; it also presents results for other popular architectures. Expand
ScalaBLAST: A Scalable Implementation of BLAST for High-Performance Data-Intensive Bioinformatics Analysis
TLDR
ScalaBLAST is developed, which accommodates very large databases and which scales linearly to as many as thousands of processors on both distributed memory and shared memory architectures, representing a substantial improvement over the current state-of-the-art in high-performance sequence alignment with scaling and portability. Expand
Speeding up subset seed algorithm for intensive protein sequence comparison
  • Van Hoa Nguyen, D. Lavenier
  • Computer Science
  • 2008 IEEE International Conference on Research, Innovation and Vision for the Future in Computing and Communication Technologies
  • 2008
TLDR
A subset seed algorithm for intensive protein sequence comparison is presented by using indexing technique and fine grained parallelism of GPU and SIMD instructions for speeding up the treatment of this task. Expand
Optimizing data intensive GPGPU computations for DNA sequence alignment
TLDR
This work exhaustively examined 128 GPU data layout configurations to improve register footprint and running time and conclude higher occupancy has greater impact than reduced latency. Expand
BLAST+: architecture and applications
TLDR
The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences. Expand
Accelerating BLASTP on the Cell Broadband Engine
TLDR
This paper demonstrates how the PlayStation®3, powered by the Cell Broadband Engine, can be used as an efficient computational platform to accelerate the popular BLASTP algorithm. Expand
...
1
2
3
4
...