Honing and proofing Astrophysical codes on the road to Exascale. Experiences from code modernization on many-core systems

  title={Honing and proofing Astrophysical codes on the road to Exascale. Experiences from code modernization on many-core systems},
  author={Salvatore Cielo and Luigi Iapichino and Fabio Baruffa and Matteo Bugli and Christoph Federrath},
  journal={Future Gener. Comput. Syst.},
2 Citations
Mansard Roofline Model: Reinforcing the Accuracy of the Roofs
The Mansard Roofline Model (MaRM), proposed in this work, uncovers a minimum set of architectural features that must be considered to provide insightful, but yet accurate and realistic, modeling of performance upper bounds for modern processors.


Performance Optimisation of Smoothed Particle Hydrodynamics Algorithms for Multi/Many-Core Architectures
A strategy for code modernisation of Gadget, a widely used community code for computational astrophysics, targeting current multi/many-core Intel® architectures for node-level performance optimisation, results in shorter execution time and improved threading scalability.
WOMBAT: A Scalable and High-performance Astrophysical Magnetohydrodynamics Code
A novel hybrid OpenMP/MPI programming model that emerged from a collaboration between Cray, Inc. and the University of Minnesota is described, which allows the code to run extremely efficiently at very high thread counts ideal for the latest generation of multi-core and many-core architectures.
Speeding simulation analysis up with yt and Intel Distribution for Python
A brief tutorial for the installation of yt and the Intel Distribution for Python, and the execution of each analysis task is provided, to achieve net speedups up to 4.6x on Intel Xeon Scalable processors (codename Skylake).
Energy Efficiency Features of the Intel Skylake-SP Processor and Their Impact on Performance
The effects of hardware controlled energy efficiency features for the Intel Skylake-SP processor, which includes C-state latencies increased significantly over the Haswell-EP processor generation, and the out-of-order throttling for workloads using 512 bit wide vectors.
FLASH: An Adaptive Mesh Hydrodynamics Code for Modeling Astrophysical Thermonuclear Flashes
The first version of a new-generation simulation code, FLASH, solves the fully compressible, reactive hydrodynamic equations and allows for the use of adaptive mesh refinement and contains state-of-the-art modules for the equations of state and thermonuclear reaction networks.
SWIFT: Using Task-Based Parallelism, Fully Asynchronous Communication, and Graph Partition-Based Domain Decomposition for Strong Scaling on more than 100,000 Cores
We present a new open-source cosmological code, called SWIFT, designed to solve the equations of hydrodynamics using a particle-based approach (Smooth Particle Hydrodynamics) on hybrid shared /
Yt, an open source, community-developed astrophysical analysis and visualization toolkit, is presented and its methods for reading, handling, and visualizing data, including projections, multivariate volume rendering, multi-dimensional histograms, halo finding, light cone generation, and topologically connected isocontour identification are reported.
Optimization and Parallelization of B-Spline Based Orbital Evaluations in QMC on Multi/Many-Core Shared Memory Processors
This work implements efficient nested threading in B-spline orbital evaluation kernels, paving the way towards enabling strong scaling of QMC simulations and results in greater than 4.5x speedup of miniQMC on KNL.
The Cosmological simulation code GADGET-2
We discuss the cosmological simulation code GADGET-2, a new massively parallel TreeSPH code, capable of following a collisionless fluid with the N-body method, and an ideal gas by means of smoothed
Advancing the performance of astrophysics simulations with ECHO-3DHPC
  • ©Intel Parallel Universe Mag.
  • 2018