Ahmed Khawaja

We don’t have enough information about this author to calculate their statistics. If you think this is an error let us know.
Learn More
In this paper, we demonstrate how new technology from Oracle can be utilized to provide big data analytics acceleration in a streamlined fashion. Specifically, our approach leverages the acceleration capabilities of the Data Analytics Accelerator (DAX) unit provided by Oracle's T7/M7/S7 SPARC processors and the Java Stream API to seamlessly accelerate Java(More)
Hardware vendors have announced support for on-die FPGAs in future server-class processors, and providers are touting support for on-demand FPGA acceleration in the cloud. However, OSes have not yet responded with first-class support for FPGAs. This paper proposes a design for an FPGA OS support called XENOS. XENOS provides abstractions that allow multiple,(More)
Adaptive mesh refinement (AMR) numerical methods utilizing octree data structures are an important class of HPC applications, in particular the solution of partial differential equations. Much effort goes into the implementation of efficient versions of these types of programs, where the emphasis is often on increasing multi-node performance when utilizing(More)
Kernel summation is a widely used computational kernel that involves matrix-matrix multiplication (GEMM) and matrix-vector multiplication (GEMV) computational primitives. The parallelism exhibited in kernel summation suggests performance improvement when running on GPGPU. State of the art GPU solutions apply cuBLAS library but cannot exploit much of the(More)
  • 1