Optimization And Profiling Of The Cache Performance Of Parallel Lattice Boltzmann Codes

@article{Pohl2003OptimizationAP,
  title={Optimization And Profiling Of The Cache Performance Of Parallel Lattice Boltzmann Codes},
  author={Thomas Pohl and Markus Kowarschik and Jens Wilke and Klaus Iglberger and Ulrich R{\"u}de},
  journal={Parallel Processing Letters},
  year={2003},
  volume={13},
  pages={549-560}
}
When designing and implementing highly efficient scientific applications for parallel computers such as clusters of workstations, it is inevitable to consider and to optimize the single-CPU performance of the codes. For this purpose, it is particularly important that the codes respect the hierarchical memory designs that computer architects employ in order to hide the effects of the growing gap between CPU performance and main memory speed. In this article, we present techniques to enhance the… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-10 OF 72 CITATIONS

Data Layout Transformation Exploiting Memory-Level Parallelism in Structured Grid Many-Core Applications

  • 2010 19th International Conference on Parallel Architectures and Compilation Techniques (PACT)
  • 2010
VIEW 3 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Accelerating the Parallelization of Lattice Boltzmann Method by Exploiting the Temporal Locality

  • 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC)
  • 2017
VIEW 2 EXCERPTS
CITES METHODS

FILTER CITATIONS BY YEAR

2004
2019

CITATION STATISTICS

  • 3 Highly Influenced Citations