Hierarchical QR Factorization Algorithms for Multi-core Cluster Systems

@article{Dongarra2012HierarchicalQF,
  title={Hierarchical QR Factorization Algorithms for Multi-core Cluster Systems},
  author={Jack J. Dongarra and Mathieu Faverge and Thomas H{\'e}rault and Julien Langou and Yves Robert},
  journal={2012 IEEE 26th International Parallel and Distributed Processing Symposium},
  year={2012},
  pages={607-618}
}
This paper describes a new QR factorization algorithm which is especially designed for massively parallel platforms combining parallel distributed multi-core nodes. These platforms make the present and the foreseeable future of high-performance computing. Our new QR factorization algorithm falls in the category of the tile algorithms which naturally enables good data locality for the sequential kernels executed by the cores (high sequential performance), low number of messages in a parallel… CONTINUE READING
Highly Cited
This paper has 25 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 17 extracted citations

Micro-architectural Enhancements in Distributed Memory CGRAs for LU and QR Factorizations

2015 28th International Conference on VLSI Design • 2015
View 8 Excerpts
Highly Influenced

Hardware Architecture Based on Parallel Tiled QRD Algorithm for Future MIMO Systems

IEEE Transactions on Very Large Scale Integration (VLSI) Systems • 2017
View 1 Excerpt

Achieving Efficient QR Factorization by Algorithm-Architecture Co-design of Householder Transformation

2016 29th International Conference on VLSI Design and 2016 15th International Conference on Embedded Systems (VLSID) • 2016
View 2 Excerpts

References

Publications referenced by this paper.
Showing 1-10 of 20 references

Flexible Development of Dense Linear Algebra Algorithms on Massively Parallel Architectures with DPLASMA

2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum • 2011
View 7 Excerpts

Tiled QR factorization algorithms

2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC) • 2011
View 5 Excerpts

Computing the R of the QR factorization of tall and skinny matrices using MPI Reduce

J. Langou
arXiv, Tech. Rep. 1002.4250, 2010. • 2010
View 2 Excerpts

QR factorization of tall and skinny matrices in a grid computing environment

2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS) • 2010
View 4 Excerpts

Scalable Tile Communication-Avoiding QR Factorization on Multicore Cluster Systems

2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis • 2010
View 9 Excerpts

Tile QR factorization with parallel panel processing for multicore architectures

2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS) • 2010
View 4 Excerpts

Minimizing communication in sparse matrix solvers

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis • 2009

Similar Papers

Loading similar papers…