Multiprocessing GPU Acceleration of H.264/AVC Motion Estimation under CUDA Architecture

Abstract

Abstract— This work presents a parallel GPU-based solution for the Motion Estimation (ME) process in a video encoding system. We propose a way to partition the steps of Full Search block matching algorithm in the CUDA architecture, and to compare the performance with a theoretical model and two implementations (sequential and parallel using OpenMP library). We obtained a O(n2/log2n) speed-up which fits the theoretical model considering different search areas. It represents up to 600x gain compared to the serial implementation, and 66x compared to the parallel OpenMP implementation.

2 Figures and Tables

Cite this paper

@inproceedings{Monteiro2011MultiprocessingGA, title={Multiprocessing GPU Acceleration of H.264/AVC Motion Estimation under CUDA Architecture}, author={Eduarda Monteiro and Bruno Boessio Vizzotto and Cl{\'a}udio Machado Diniz and Bruno Zatt and Sergio Bampi}, year={2011} }