Blocking LU Decomposition for FPGAs

  title={Blocking LU Decomposition for FPGAs},
  author={Guiming Wu and Yong Dou and Gregory D. Peterson},
  journal={2010 18th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines},
To efficiently perform large matrix LU decomposition on FPGAs with limited local memory, the original algorithm needs to be blocked. In this paper, we propose a block LU decomposition algorithm for FPGAs, which is applicable for matrices of arbitrary size. We introduce a high performance hardware design, which mainly consists of a linear array of processing elements (PEs), to implement our block LU decomposition algorithm. A total of 36 PEs can be integrated into a Xilinx Virtex-5 xc5vlx330… CONTINUE READING


Publications referenced by this paper.
Showing 1-10 of 11 references

Exploring Accelerating Science Applications with FPGAs

  • O. Storaasli, D. Strenski
  • RSSI, 2007.
  • 2007
1 Excerpt

LAPACK Users’ Guide

  • E. Anderson, Z. Bai, +8 authors D. Sorensen
  • The Society for Industrial and Applied…
  • 1999
2 Excerpts

Similar Papers

Loading similar papers…