Corpus ID: 14014986

Performance Engineering for a Medical Imaging Application on the Intel Xeon Phi Accelerator

  title={Performance Engineering for a Medical Imaging Application on the Intel Xeon Phi Accelerator},
  author={Johannes Hofmann and Jan Treibig and G. Hager and G. Wellein},
  booktitle={ARCS Workshops},
We examine the Xeon Phi, which is based on Intel's Many Integrated Cores architecture, for its suitability to run the FDK algorithm--the most commonly used algorithm to perform the 3D image reconstruction in cone-beam computed tomography. We study the challenges of efficiently parallelizing the application and means to enable sensible data sharing between threads despite the lack of a shared last level cache. Apart from parallelization, SIMD vectorization is critical for good performance on the… Expand
High-performance X-ray tomography reconstruction algorithm based on heterogeneous accelerated computing systems
Performance portable back-projection algorithms on CPUs: agnostic data locality and vectorization optimizations
Cache-Aware GPU Optimization for Out-of-Core Cone Beam CT Reconstruction of High-Resolution Volumes
A survey on evaluating and optimizing performance of Intel Xeon Phi
Modeling Gather and Scatter with Hardware Performance Counters for Xeon Phi
A review of GPU-based medical image reconstruction.
  • P. Després, X. Jia
  • Computer Science, Medicine
  • Physica medica : PM : an international journal devoted to the applications of physics to medicine and biology : official journal of the Italian Association of Biomedical Physics
  • 2017
iFDK: a scalable framework for instant high-resolution image reconstruction
Parallel Image Processing on the Sunway Many-Core Processor
  • Meiting Zhao, Rui Liu, Y. Liu, Kaida Song, D. Qian
  • Computer Science
  • 2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS)
  • 2016


Evaluation of state-of-the-art hardware architectures for fast cone-beam CT reconstruction
Systematic Performance Optimization of Cone-Beam Back-Projection on the Kepler Architecture
Technical note: RabbitCT--an open platform for benchmarking 3D cone-beam reconstruction algorithms.
ispc: A SPMD compiler for high-performance CPU programming
GPU computing in medical physics: a review.
Introducing a Performance Model for Bandwidth-Limited Loop Kernels