The calculation of long range interactions is a computa-tionally demanding task in particle based simulations. To reduce the computational complexity from O(N 2) to O(N) an algorithm based on a fast 2d-Wavelet transform technique was developed. In this algorithm, a CPU and memory demanding part is to construct the 2d-Wavelet transform of a grid based… (More)
The Cell Superscalar framework (CellSs), from Barcelona Supercomputing Centre, offers a high-level portable programming model to port, parallelise and tune applications on Cell Broadband Engine. Via implementation of a Jacobi solver and a triple-matrix-multiply (TMM) kernel from a wavelet-based evaluation of Coulomb potentials in molecular systems, using… (More)
A QS20 Memory Management 40 i ii CONTENTS B SUMMA Algorithm 43 Introduction In the near future computer platforms will be based on multicore processors which will concentrate dozens or even hundreds of cores on a chip. Hence the number of cores in a high performance cluster will increase to hundreds of thousands. As a result, sequential and parallel… (More)
Permission to make digital or hard copies of portions of this work for personal or classroom use is granted provided that the copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise requires prior specific permission by the publisher mentioned above.