Replicating the Performance Evaluation of an N-Body Application on a Manycore Accelerator

Abstract

Reproducibility for High Performance Computing (HPC) systems has been discussed for some time already, but more work should be carried out to cover the latest accelerators that equip the fastest supercomputers such as the ones listed in Top500. In this paper, we perform a replication of a performance evaluation carried out using an N-Body Open MP parallel application on a XeonPhi accelerator. We also compare the obtained performance with a similar N-Body CUDA application. Besides encountering intriguing results about the Xeon Phi on the number of hardware threads, our comparison against Nvidia boards using the same load shows that the execution Xeon Phi is slower than on Nvidia K20 and GTX760 accelerators.

10 Figures and Tables

Cite this paper

@article{Pinto2015ReplicatingTP, title={Replicating the Performance Evaluation of an N-Body Application on a Manycore Accelerator}, author={Vinicius Garcia Pinto and Vinicius Alves Herbstrith and Lucas Mello Schnorr}, journal={2015 International Symposium on Computer Architecture and High Performance Computing Workshop (SBAC-PADW)}, year={2015}, pages={19-24} }