Implementation of Digital Signal Processing Algorithm in General Purpose Graphics Processing Unit ( GPGPU )

In this paper, we have proposed sequential and parallel matrix and matrix-vector multiplication in compute unified device architecture (CUDA) libraries. We show the process of a class of algorithms parallelization which are used in digital signal processing. We present this approach on the instance of the Linear Convolution, Circular Convolution, and Least… CONTINUE READING