Rodrigo Dominguez

Learn More
Loop vectorization, a key feature exploited to obtain high performance on Single Instruction Multiple Data (SIMD) vector architectures, is significantly hindered by irregular memory access patterns in the data stream. This paper describes data transformations that allow us to vectorize loops targeting massively multithreaded data parallel architectures. We(More)
Extensive research efforts have been devoted to the feasibility of picture archiving and communication systems (PACS) in recent years. The advantages of PACS are numerous but mainly include reduced cost and improvement in the operational efficiency of a PACS-based radiology department. In digital radiography, images are viewed either in hard-copy or(More)
Graphics Processing Units (GPU) have become the platform of choice for accelerating a large range of data parallel and task parallel applications. Both AMD and NVIDIA have developed GPU implementations targeted at the high performance computing market. The rapid adoption of GPU computing has been greatly aided by the introduction of high-level programming(More)
The current trend toward heterogeneous architectures motivates us to reconsider current software and hardware paradigms. The focus is centered around new parallel programming models, compiler design, and runtime resource management techniques to exploit the features of many-core processor architectures. Graphics Processing Units (GPU) have become the(More)
In this paper, a scheduling scheme called remote queueing multiple access-code division multiple access (RQMA-CDMA) is proposed. This scheduling scheme provides flexibility in bandwidth allocation and QoS. In addition, admission control guarantees the required E/sub b//N/sub o/ for multimedia traffic. When comparing the proposed RQMA-CDMA scheme to the(More)
  • 1