Learn More
Emerging general-purpose Graphics Processing Unit (GPU) provides a multi-core platform for wide applications, including machine learning algorithms. In this paper, we proposed several techniques to accelerate Support Vector Machines (SVM) on GPUs. Sparse matrix format is introduced into parallel SVM to achieve better performance. Experimental results show(More)
Motion Estimation plays an important role in many applications. A shared-memory-optimized implementation of motion estimation is studied in this work, where bank conflict is minimized and bank occupancy is maximized. Experimental results shows that at most 38 times speedup can be achieved compared with a 3GHz CPU. Moreover, based on the optimization(More)
  • 1