A Multi-GPU Implementation of a D2Q37 Lattice Boltzmann Code

  title={A Multi-GPU Implementation of a D2Q37 Lattice Boltzmann Code},
  author={Luca Biferale and Filippo Mantovani and Marcello Pivanti and Fabio Pozzati and Mauro Sbragaglia and Andrea Scagliarini and Sebastiano Fabio Schifano and Federico Toschi and Raffaele Tripiccione},
We describe a parallel implementation of a compressible Lattice Boltzmann code on a multi-GPU cluster based on Nvidia Fermi processors. We analyze how to optimize the algorithm for GP-GPU architectures, describe the implementation choices that we have adopted and compare our performance results with an implementation optimized for latest generation multi-core CPUs. Our program runs at ≈ 30% of the double-precision peak performance of one GPU and shows almost linear scaling when run on the multi… CONTINUE READING
8 Citations
11 References
Similar Papers


Publications citing this paper.
Showing 1-8 of 8 extracted citations


Publications referenced by this paper.
Showing 1-10 of 11 references

Performance evaluation of a parallel sparse lattice Boltzmann solver

  • L Axner
  • Journal of Computational Physics 227(10), 4895…
  • 2008
Highly Influential
7 Excerpts

Speeding up a Lattice Boltzmann Kernel on nVIDIAGPUs

  • J. Habich, T. Zeiser, G. Hager, G. Wellein
  • Proc. of PARENG09-S01, Pecs, Hungary
  • 2009
1 Excerpt

The Lattice Boltzmann Equation for Fluid Dynamics and Beyond

  • S. Succi
  • Oxford University Press
  • 2001
1 Excerpt

Similar Papers

Loading similar papers…