Octo-Tiger’s New Hydro Module and Performance Using HPX+CUDA on ORNL’s Summit

@article{Diehl2021OctoTigersNH,
  title={Octo-Tiger’s New Hydro Module and Performance Using HPX+CUDA on ORNL’s Summit},
  author={Patrick Diehl and Gregor Dai{\ss} and Dominic Marcello and Kevin A. Huck and Sagiv Shiber and Hartmut Kaiser and Juhan Frank and Dirk Pfl{\"u}ger},
  journal={2021 IEEE International Conference on Cluster Computing (CLUSTER)},
  year={2021},
  pages={204-214}
}
Octo-Tiger is a code for modeling three-dimensional self-gravitating astrophysical fluids. It was particularly designed for the study of dynamical mass transfer between interacting binary stars. Octo-Tiger is parallelized for distributed systems using the asynchronous many-task runtime system, the C++ standard library for parallelism and concurrency (HPX) and utilizes CUDA for its gravity solver. Recently, we have remodeled Octo-Tiger’s hydro solver to use a three-dimensional reconstruction… 

Figures and Tables from this paper

References

SHOWING 1-10 OF 37 REFERENCES

Accelerating Octo-Tiger: Stellar Mergers on Intel Knights Landing with HPX

This paper studies the node-level performance of Octo-Tiger's fast multipole method, and achieves up to 408 GFLOPS, resulting in a speedup of 2x compared to a 24-core Skylake-SP platform, using the same high-level abstractions.

octo-tiger: a new, 3D hydrodynamic code for stellar mergers that uses hpx parallelization

OCTO-TIGER is an astrophysics code to simulate the evolution of self-gravitating and rotat-ing systems of arbitrary geometry based on the fast multipole method, using adaptive mesh refinement.

From piz daint to the stars: simulation of stellar mergers using high-level abstractions

Octo-Tiger, a finite volume grid-based hydrodynamics simulation code with Adaptive Mesh Refinement which is unique in conserving both linear and angular momentum to machine precision, is developed and extended to heterogeneous GPU-accelerated supercomputers, demonstrating node-level performance and portability.

Harnessing billions of tasks for a scalable portable hydrodynamic simulation of the merger of two stars

We present a highly scalable demonstration of a portable asynchronous many-task programming model and runtime system applied to a grid-based adaptive mesh refinement hydrodynamic simulation of a

Performance Measurements Within Asynchronous Task-Based Runtime Systems: A Double White Dwarf Merger as an Application

This work added HPX application-specific performance counters to the Octo-Tiger full 3-D adaptive multigrid code astrophysics application, enabling the combined visualization of physical and performance data to highlight bottlenecks with respect to different solvers.

K-Athena: A Performance Portable Structured Grid Finite Volume Magnetohydrodynamics Code

This work combines an existing magnetohydrodynamics CPU code with a performance portable on-node parallel programming paradigm into K-Athena to allow efficient simulations on multiple architectures using a single codebase and presents profiling and scaling results for different platforms.

Implementation of Peridynamics utilizing HPX - the C++ standard library for parallelism and concurrency

This paper presents a peridynamics EMU nodal discretization implementation with the C++ Standard Library for Concurrency and Parallelism (HPX), an open source asynchronous many task run time system.

A TWO-MOMENT RADIATION HYDRODYNAMICS MODULE IN ATHENA USING A TIME-EXPLICIT GODUNOV METHOD

We describe a module for the Athena code that solves the gray equations of radiation hydrodynamics (RHD), based on the first two moments of the radiative transfer equation. We use a combination of

Enzo-P / Cello: scalable adaptive mesh refinement for astrophysics and cosmology

The known scaling issues in Enzo are elaborate on, and how the Cello framework is to address these and other scaling issues through a combination of existing and novel approaches.

Massively parallel cosmological simulations with ChaNGa

ChaNGa, a production simulator based on the CHARM++ infrastructure, is presented and the multistepping capabilities of ChaNGa are presented, as are solutions to the load imbalance that such multiphase simulations face.