High performance comparison-based sorting algorithm on many-core GPUs

  title={High performance comparison-based sorting algorithm on many-core GPUs},
  author={Xiaochun Ye and Dongrui Fan and Wei Ju Lin and Nan Yuan and Paolo Ienne},
  journal={2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)},
Sorting is a kernel algorithm for a wide range of applications. We present a new algorithm, GPU-Warpsort, to perform comparison-based parallel sort on Graphics Processing Units (GPUs). It mainly consists of a bitonic sort followed by a merge sort. Our algorithm achieves high performance by efficiently mapping the sorting tasks to GPU architectures. Firstly, we take advantage of the synchronous execution of threads in a warp to eliminate the barriers in bitonic sorting network. We also provide… CONTINUE READING
Highly Cited
This paper has 59 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 40 extracted citations

59 Citations

Citations per Year
Semantic Scholar estimates that this publication has 59 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 20 references

Broad - phase collision detection with CUDA , ” in

  • Q. Luo, B. Smith
  • GPU Gems
  • 2007

Broad-phase collision detection with CUDA

  • S. Le Grand
  • GPU Gems 3, H. Nguyen, Ed. Addison-Wesley…
  • 2007
2 Excerpts

Similar Papers

Loading similar papers…