Rodrigo Dominguez

Learn More
Loop vectorization, a key feature exploited to obtain high performance on Single Instruction Multiple Data (SIMD) vector architectures, is significantly hindered by irregular memory access patterns in the data stream. This paper describes data transformations that allow us to vectorize loops targeting massively multithreaded data parallel architectures. We(More)
Graphics Processing Units (GPU) have become the platform of choice for accelerating a large range of data parallel and task parallel applications. Both AMD and NVIDIA have developed GPU implementations targeted at the high performance computing market. The rapid adoption of GPU computing has been greatly aided by the introduction of high-level programming(More)
Extensive research efforts have been devoted to the feasibility of picture archiving and communication systems (PACS) in recent years. The advantages of PACS are numerous but mainly include reduced cost and improvement in the operational efficiency of a PACS-based radiology department. In digital radiography, images are viewed either in hard-copy or(More)
  • 1