Cache miss characterization and data locality optimization for imperfectly nested loops on shared memory multiprocessors

@article{Sahoo2005CacheMC,
  title={Cache miss characterization and data locality optimization for imperfectly nested loops on shared memory multiprocessors},
  author={S. Sahoo and Rajkiran Panuganti and S. Krishnamoorthy and P. Sadayappan},
  journal={19th IEEE International Parallel and Distributed Processing Symposium},
  year={2005},
  pages={10 pp.-}
}
This paper develops an algorithm to accurately characterize the number of cache misses for a class of compute-intensive calculations encountered in accurate quantum chemistry models of electronic structure. The proposed approach can handle imperfectly nested loop structures, symbolic loop bounds, and non-constant dependences for a constrained class of array references. It is proposed in the context of tensor contraction computations, and extends previous work on "stack distances" by Almasi et… Expand
Integrated Loop Optimizations for Data Locality Enhancement of Tensor Contraction Expressions
A Scalable Analytical Memory Model for CPU Performance Prediction
High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation

References

SHOWING 1-10 OF 24 REFERENCES
Improving data locality with loop transformations
Data and computation transformations for multiprocessors
Estimating cache misses and locality using stack distances
Locality Enhancement Of Imperfectly-nested Loop Nests
Data-centric multi-level blocking
Cache miss equations: a compiler framework for analyzing and tuning memory behavior
On Optimizing a Class of Multi-Dimensional Loops with Reductions for Parallel Execution
Precise miss analysis for program transformations with caches of arbitrary associativity
...
1
2
3
...