We address in this paper the parallelization of a recursive algorithm for large scale triangular matrix inversion based on the â€˜Divide and Conquerâ€™ (D&C) paradigm. A set of different versions of anâ€¦ (More)

GPU-accelerated computing consists in using a graphics processing unit (GPU) together with a CPU in order to enhance the performance of scientific and engineering applications. With the increasingâ€¦ (More)

â€˜Divide and Conquerâ€™ (D&C) is a famous paradigm for designing efficient algorithms and improving the effectiveness of computer memory hierarchies. Indeed, D&C-based matrix algorithms operate onâ€¦ (More)

2017 International Conference on High Performanceâ€¦

2017

There are several approaches for computing the inverse of a dense square matrix, say A, namely Gaussian elimination, block wise inversion, and LU factorization (LUF). The latter is used inâ€¦ (More)

We address in this paper the parallelization of a recursive algorithm for triangular matrix inversion (TMI) based on the â€˜Divide and Conquerâ€™ (D&C) paradigm. A series of different versions of anâ€¦ (More)