• Corpus ID: 15056944

Rapid Performance of a Generalized Distance Calculation

@inproceedings{Fisackerly2011RapidPO,
  title={Rapid Performance of a Generalized Distance Calculation},
  author={Scott Fisackerly and E. Chu and D. L. Foster},
  year={2011}
}
The ever-increasing size of data sets and the need for real-time processing drives the need for high speed analysis. Since traditional CPUs are designed to execute a small number of sequential process, they are ill-suited to keep pace with this growth and exploit the massive parallelism inherent in these problem spaces. In the last several years, the parallelism of GPUs has made them a viable solution for general purpose computing. However, effective use of GPUs requires a significantly… 

References

SHOWING 1-8 OF 8 REFERENCES
Clustering billions of data points using GPUs
TLDR
This paper investigates if GPUs can be useful accelerators even with very large data sets that cannot fit into GPU's onboard memory, and uses a popular clustering algorithm, K-Means, as an example.
K-Means on Commodity GPUs with CUDA
TLDR
In this algorithm, in order to accelerate compute-intensive portions of traditional k-means, both data objects assignment and k centroids recalculation are offloaded to the GPU in parallel.
Programming Massively Parallel Processors. A Hands-on Approach
  • Jie Cheng
  • Computer Science
    Scalable Comput. Pract. Exp.
  • 2010
TLDR
This comprehensive test/reference provides a foundation for the understanding and implementation of parallel programming skills which are needed to achieve breakthrough results by developing parallel applications that perform well on certain classes of Graphic Processor Units (GPUs).
Hierarchical Agglomerative Clustering Using Graphics Processor with Compute Unified Device Architecture
TLDR
This paper has used CUDA to implement the complete hierarchical agglomerative clustering algorithm and shows almost double the speed gain using much cheaper desk top graphics card.
CUDA by Example: An Introduction to General-Purpose GPU Programming
  • Jie Cheng
  • Computer Science
    Scalable Comput. Pract. Exp.
  • 2010
TLDR
This book is designed for readers who are interested in studying how to develop general parallel applications on graphics processing unit (GPU) by using CUDA C, a programming language which combines industry standard programming C language and some more features which can exploit CUDA architecture.
High-Performance Computing -Supercomputing with Tesla GPUs
  • High-Performance Computing -Supercomputing with Tesla GPUs
  • 2011
What is CUDA? Available
  • What is CUDA? Available
  • 2011
Optimizing Matrix Transpose in CUDA Available: http://developer.download.nvidia.com/compute/cuda/sdk/we bsite/C/src/transposeNew/doc/MatrixTranspose
  • Tesla C1060 Computing Processor Board. Available: http://www.nvidia.com/docs/IO/43395/BD-04111- 001_v05.pdf Figure 7 Average running times in seconds on
  • 2009