Acceleration of three-dimensional Tokamak magnetohydrodynamical code with graphics processing unit and OpenACC heterogeneous parallel programming

  title={Acceleration of three-dimensional Tokamak magnetohydrodynamical code with graphics processing unit and OpenACC heterogeneous parallel programming},
  author={H. W. Zhang and J. Zhu and Z. W. Ma and Guang Yuan Kan and X. Wang and W. Zhang},
  journal={International Journal of Computational Fluid Dynamics},
  pages={393 - 406}
  • H. W. ZhangJ. Zhu W. Zhang
  • Published 3 December 2018
  • Computer Science, Physics
  • International Journal of Computational Fluid Dynamics
ABSTRACT In this paper, the OpenACC heterogeneous parallel programming model is successfully applied to modification and acceleration of the three-dimensional Tokamak magnetohydrodynamical code (CLT). Through combination of OpenACC and MPI technologies, CLT is further parallelised by using multiple-GPUs. Significant speedup ratios are achieved on NVIDIA TITAN Xp and TITAN V GPUs, respectively, with very few modifications of CLT. Furthermore, the validity of the double precision calculations on… 

GPU-Accelerated Computations for Supersonic Flow Modeling on Hybrid Grids

This paper aims to design an extremely efficient double-precision GPU-accelerated parallel algorithm for supersonic flow computations on hybrid grids for computational fluid dynamics (CFD).

Three types of pressure crash in the low magnetic shear tokamaks

Numerical investigations on the pressure crash with a low magnetic shear profile in Tokamaks are carried out through the three-dimensional, toroidal, and nonlinear MHD code CLT. We find that there

Penetration properties of resonant magnetic perturbation in EAST Tokamak

Our three-dimensional toroidal magnetohydrodynamics code [CLT (Ci-Liu-Ti, which means magnetohydrodynamics in Chinese)] is upgraded to CLTx in which the simulation domain extends to a scrape-off

Influence of Hall effect and toroidal flow on the plasmoid formation and incomplete reconnection in a low resistivity plasma in tokamak

The nonlinear resistive-kink mode in the low resistivity plasma in tokamak is investigated through the three-dimensional, toroidal, and nonlinear Hall-MHD code CLT. It is found that, without the

Role of Hall effect on the resistive kink mode in tokamaks

The influence of the Hall effect on the nonlinear evolution of the m/n = 1/1 resistive-kink mode is numerically investigated by the three-dimensional toroidal Hall-MHD code CLT. It is found that the

Systematic simulation studies on the penetration of resonant magnetic perturbations in an Experimental Advanced Superconducting Tokamak

The penetration properties of the n = 1 resonant magnetic perturbations (RMPs) with toroidal rotation are systematically studied by the upgraded three-dimensional toroidal magnetohydrodynamic code

Sawtooth relaxation oscillations, nonlinear helical flows and steady-state m/n=1 magnetic islands in low-viscosity tokamak plasma simulations

A numerical study on the influence of plasma viscosity and of the plasma β (=kinetic pressure/magnetic pressure) parameter on the nonlinear evolution of resistive internal kink modes in tokamak

Influence of aspect ratio, plasma viscosity, and radial position of the resonant surfaces on the plasmoid formation in the low resistivity plasma in Tokamak

In the present paper, we systematically investigate the nonlinear evolution of the resistive kink mode in the low resistivity plasma in Tokamak geometry. We find that the aspect ratio of the initial

The off-axis pressure crash associated with the nonlinear evolution of the m/n = 2/1 double tearing mode

The nonlinear evolution of the m/n = 2/1 double tearing mode (DTM) is investigated by the toroidal resistive magnetohydrodynamic code CLT. It is found that the m/n = 2/1 DTM can lead to either a core



Three Dimensional Pseudo-Spectral Compressible Magnetohydrodynamic GPU Code for Astrophysical Plasma Simulation

The experimental results indicate that the GPU accelerated code is able to achieve upto two orders of magnitude speedup over a corresponding OpenMP parallel, FFTW library based code, on a NVIDIA Tesla P100 GPU.

An MPI/OpenACC implementation of a high-order electromagnetics solver with GPUDirect communication

Performance results and an analysis of a message passing interface/OpenACC implementation of an electromagnetic solver based on a spectral-element discontinuous Galerkin discretization of the time-dependent Maxwell equations show more than 2.5× speedup over central processing unit-only performance on the same number of nodes.

Acceleration of a Particle-in-Cell Code for Space Plasma Simulations with OpenACC

A case study of using OpenACC to offload the computation intensive parts: particle mover and interpolation of particles to grids, in a massively parallel Particle-in-Cell simulation code, iPIC3D, to multi-GPU systems.

GPU Acceleration of an Established Solar MHD Code using OpenACC

It is found that the GPU-accelerated MAS code has the ability to run “zero-beta” simulations on a single multi-GPU server at speeds previously requiring multiple CPU server-nodes of a supercomputer.

Accelerating a C++ CFD Code with OpenACC

It is shown how the texture cache on NVIDIA GPUs can be used to minimize the performance impact of suboptimal memory access patterns on accelerators without writing platform specific code.

OpenACC acceleration of the Nek5000 spectral element code

A case study of porting NekBone, a skeleton version of the Nek5000 code, to a parallel GPU-accelerated system, and profiling of NekBone provided an assessment of the suitability of the code for GPU systems, and indicated possible kernel optimizations.

A heterogeneous computing accelerated SCE-UA global optimization method using OpenMP, OpenCL, CUDA, and OpenACC.

Comparison results indicate the parallel SCE-UA significantly improves computational efficiency compared to the original serial version and the OpenCL implementation obtains the best overall acceleration results however, with the most complex source code.

Hall effect on tearing mode instabilities in tokamak

The tearing mode instability is one of the most important dynamic processes in space and laboratory plasmas. Hall effects, resulting from the decoupling of electron and ion motions, can cause fast

A multi-core CPU and many-core GPU based fast parallel shuffled complex evolution global optimization approach

The recently emerging heterogeneous parallel computing (parallel computing by using the multi-core CPU and many-core GPU) was applied in the parallelization and acceleration of the SCE-UA to test the performance based on the Griewank benchmark function.