TESLA GPUs versus MPI with OpenMP for the Forward Modeling of Gravity and Gravity Gradient of Large Prisms Ensemble

  title={TESLA GPUs versus MPI with OpenMP for the Forward Modeling of Gravity and Gravity Gradient of Large Prisms Ensemble},
  author={Carlos Couder-Casta{\~n}eda and Carlos Ortiz-Alem{\'a}n and Mauricio Gabriel Orozco-del-Castillo and Mauricio Nava-Flores},
  journal={J. Appl. Math.},
An implementation with the CUDA technology in a single and in several graphics processing units (GPUs) is presented for the calculation of the forward modeling of gravitational fields from a tridimensional volumetric ensemble composed by unitary prisms of constant density. We compared the performance results obtained with the GPUs against a previous version coded in OpenMP with MPI, and we analyzed the results on both platforms. Today, the use of GPUs represents a breakthrough in parallel… 
A Performance Study of a Dual Xeon-Phi Cluster for the Forward Modelling of Gravitational Fields
This research shows an efficient strategy based on nested parallelism using Open MP, a design that in its outer structure acts as a controller of interconnected Xeon-Phi coprocessors while its interior is used for parallelyzing the loops.
Contributions of hybrid architectures to depth imaging: a CPU, APU and GPU comparative study. (Apports des architectures hybrides à l'imagerie profondeur : étude comparative entre CPU, APU et GPU)
It is shown that APUs deliver performances that range between those of CPUs and those of GPUs, and that the APU can be as power efficient as the GPU.
Full tensor gravity gradiometry data inversion: Performance analysis of parallel computing algorithms
We apply reweighted inversion focusing to full tensor gravity gradiometry data using message-passing interface (MPI) and compute unified device architecture (CUDA) parallel computing algorithms, and
Fast Density Inversion Solution for Full Tensor Gravity Gradiometry Data
The classical preconditioned conjugate gradient method for full tensor gravity gradiometry data is modified and implemented on a cluster to achieve rapid density inversions for various scenarios, overcoming the problems of computation time and memory requirements caused by too many iterations.
Analysis of Electromagnetic Propagation from MHz to THz with a Memory-Optimised CPML-FDTD Algorithm
This paper presents an optimised FDTD-CPML algorithm, focused in saving memory while increasing the performance of the algorithm, and concludes that the implemented CPML ABC constitutes a robust choice in terms of precision and accuracy for the high frequencies herein considered.
OpenMP Implementation of a Novel Potential-Field-Data Source-Growth-Based Inversion Approach for 3D Salt Imaging in Deepwater Gulf of Mexico
Potential-field-data imaging of complex geological features in deepwater salt-tectonic regions in the Gulf of Mexico remains an open active research field. There is still a lack of resolution in
3D Gravity Modeling of Complex Salt Features in the Southern Gulf of Mexico
We present a three-dimensional (3D) gravity modeling and inversion approach and its application to complex geological settings characterized by several allochthonous salt bodies embedded in
3 D Gravity Modeling of Complex Salt Features in the Southern Gulf of Mexico
1Facultad de Ingenieŕıa, Universidad Nacional Autónoma de México (UNAM), Avenida Universidad No. 3000, CU, Coyoacán, 04510 Ciudad de México, DF, Mexico 2Instituto Mexicano del Petróleo, Eje Central


Accelerating a three-dimensional finite-difference wave propagation code using GPU graphics cards
The implementation of the code in CUDA to simulate the propagation of seismic waves in a heterogeneous elastic medium and the code that runs on a graphics card gives the expected results by comparing the results to those obtained by running the same simulation on a classical processor core.
Parallel 3d Finite-Difference Time-Domain Method On Multi-Gpu Systems
This paper presents implementations of three-dimensional FDTD with convolutional perfect match layer (CPML) absorbing boundary conditions on graphics processing unit (GPU) and extends the parallel algorithm to multiple GPUs in order to solve electrically large structures.
Acceleration of computation speed for elastic wave simulation using a Graphic Processing Unit
This study simulates two- and three-dimensional elastic wave propagation using the Finite-Difference Time-Domain (FDTD) method on GPUs using the staggered-grid method, which is one of the conventional FD schemes, since this method can achieve sufficient accuracy for use in numerical modelling in geophysics.
Fast seismic modeling and Reverse Time Migration on a GPU cluster
A fast parallel simulator that solves the acoustic wave equation on a GPU cluster, using CUDA to take advantage of the GPUs computational power and considers a finite difference approach on a regular mesh, in both 2D and 3D cases.
Enabling Mixed OpenMP/MPI Programming on Hybrid CPU/GPU Computing Architecture
  • T. Liang, Hung-Fu Li, Jun-Yao Chiu
  • Computer Science
    2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum
  • 2012
A framework called OMPICUDA is proposed for users to develop parallel applications on the hybrid CPU/GPU clusters by mixing the APIs of OpenMP and MPI and supports resource reallocation based on the states of CPUs and GPUs.
Massively parallel forward modeling of scalar and tensor gravimetry data
A Survey of General-Purpose Computation on Graphics Hardware
The techniques used in mapping general-purpose computation to graphics hardware will be generally useful for researchers who plan to develop the next generation of GPGPU algorithms and techniques.