Zurich ̈ Technische Hochschule

Abstract

We present PARDISO, a new scalable parallel sparse direct linear solver on shared memory multiprocessors. In this paper, we describe the parallel fac-torization algorithm which utilizes the supernode structure of the matrix to reduce the number of memory references with Level 3 BLAS. We also propose enhancements that signiicantly reduce the communication rate for pipelining parallelism. The result is a greatly increased factorization performance. Furthermore , we have investigated popular shared memory multiprocessors and the most popular numerical algorithms commonly used for the solution of the classical drift-diiusion and the diiusion-reaction equations in semiconductor device and process simulation. The study includes a preconditioned iterative linear solver package and our parallel direct linear solver. Moreover, we have investigated the eeciency and the limits of our parallel approach. Results of several simulations of up to 300'000 unknowns for three-dimensional simulations are presented to illustrate our approach towards robust, parallel semiconductor device and process simulation. Abstract. We present PARDISO, a new scalable parallel sparse direct linear solver on shared memory multiprocessors. In this paper, we describe the parallel factorization algorithm which utilizes the supernode structure of the matrix to reduce the number of memory references with Level 3 BLAS. We also propose enhancements that signiicantly reduce the communication rate for pipelining parallelism. The result is a greatly increased factorization performance. Furthermore, we have investigated popular shared memory multiprocessors and the most popular numerical algorithms commonly used for the solution of the classical drift-diiusion and the diiusion-reaction equations in semiconductor device and process simulation. The study includes a preconditioned iterative linear solver package and our parallel direct linear solver. Moreover, we have investigated the eeciency and the limits of our parallel approach. Results of several simulations of up to 300'000 unknowns for three-dimensional simulations are presented to illustrate our approach towards robust, parallel semiconductor device and process simulation.

Extracted Key Phrases

12 Figures and Tables

Cite this paper

@inproceedings{Schenk1999ZurichT, title={Zurich ̈ Technische Hochschule}, author={Olaf Schenk and Wolfgang Fichtner}, year={1999} }