• Corpus ID: 249494364

Seamless GPU acceleration for C++ based physics with the Metal Shading Language on Apple's M series unified chips

@inproceedings{Gebraad2022SeamlessGA,
  title={Seamless GPU acceleration for C++ based physics with the Metal Shading Language on Apple's M series unified chips},
  author={Lars Gebraad and Andreas Fichtner},
  year={2022}
}
The M series of chips produced by Apple have proven a capable and power-efficient alternative to main-stream Intel and AMD x86 processors for everyday tasks. Additionally, the unified design integrating the central processing and graphics processing unit, have allowed these M series chips to excel at many tasks with heavy graphical requirements without the need for a discrete graphical processing unit (GPU), and in some cases even outperforming discrete GPUs. In this work, we show how the M… 

Figures and Tables from this paper

References

SHOWING 1-10 OF 22 REFERENCES

Memory access coalescing: a technique for eliminating redundant memory accesses

TLDR
A general code improvement algorithm that transforms code to better exploit the available memory bandwidth on existing microprocessors as well as wide-bus machines, and the effectiveness of the transformation varied significantly with respect to the instruction-set architecture of the tested platform.

Parallel Realization of the Element-by-Element FEM Technique by CUDA

TLDR
It is shown that this low-storage but computation-intensive technique is better suited for GPUs than those requiring the massive manipulation of large data sets.

Global adjoint tomography—model GLAD-M25

Building on global adjoint tomography model GLAD-M15, we present transversely isotropic global model GLAD-M25, which is the result of 10 quasi-Newton tomographic iterations with an earthquake

Full-waveform inversion imaging of the human brain

TLDR
In-silico simulations are used to demonstrate that full-waveform inversion, a computational technique originally developed in geophysics, is able to generate accurate three-dimensional images of the brain with sub-millimetre resolution.

Acoustoelastic full-waveform inversion for transcranial ultrasound computed tomography

Full-waveform inversion applied to ultrasound computed tomography is a promising technique to provide highresolution quantitative images of soft human tissues, which are otherwise difficult to

An overview of full-waveform inversion in exploration geophysics

TLDR
This review attempts to illuminate the state of the art of FWI by building accurate starting models with automatic procedures and/or recording low frequencies, and improving computational efficiency by data-compression techniquestomake3DelasticFWIfeasible.

psvWave: elastic wave propagation in 2d for Python and C++

We present `psvWave', a basic numerical finite difference solver for Python and C++, specifically targeted at seismologists. The solver is based on the well-established staggered grid approaches

P-SV wave propagation in heterogeneous media: Velocity‐stress finite‐difference method

I present a finite-difference method for modeling P-SV wave propagation in heterogeneous media. This is an extension of the method I previously proposed for modeling SH-wave propagation by using