# Matrix Factorization on GPUs with Memory Optimization and Approximate Computing

@article{Tan2018MatrixFO, title={Matrix Factorization on GPUs with Memory Optimization and Approximate Computing}, author={W. Tan and S. Chang and L. Fong and Cheng Li and Zijun Wang and L. Cao}, journal={Proceedings of the 47th International Conference on Parallel Processing}, year={2018} }

Matrix factorization (MF) discovers latent features from observations, which has shown great promises in the fields of collaborative filtering, data compression, feature extraction, word embedding, etc. [...] Key Method The former exploits GPU memory hierarchy to increase data reuse, while the later reduces unneccessary computing without hurting the convergence of learning algorithms. Extensive experiments on large-scale datasets show that our solution not only outperforms the competing CPU solutions by a large…Expand Abstract

#### Supplemental Code

GITHUB REPO

Via Papers with Code

CUDA Matrix Factorization Library with Alternating Least Square (ALS)

6 Citations

Block based Singular Value Decomposition approach to matrix factorization for recommender systems

- Mathematics, Computer Science
- 2019

- 4
- PDF

BMF: Block matrix approach to factorization of large scale data

- Mathematics, Computer Science
- 2019

- 1
- PDF

Identifying and (automatically) remedying performance problems in CPU/GPU applications

- Computer Science
- 2020

Diogenes: looking for an honest CPU/GPU performance measurement tool

- Computer Science
- 2019

- 2
- Highly Influenced

#### References

##### Publications referenced by this paper.

SHOWING 1-10 OF 20 REFERENCES

A fast parallel SGD for matrix factorization in shared memory systems

- Computer Science
- 2013

- 188
- Highly Influential
- PDF

Large-scale matrix factorization with distributed stochastic gradient descent

- Computer Science
- 2011

- 597
- Highly Influential
- PDF

Scalable Coordinate Descent Approaches to Parallel Matrix Factorization for Recommender Systems

- Computer Science
- 2012

- 248
- Highly Influential
- PDF

Accelerating collaborative filtering using concepts from high performance computing

- Computer Science
- 2015

- 31
- Highly Influential
- PDF

Hogwild: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent

- Mathematics, Computer Science
- 2011

- 1,686
- Highly Influential
- PDF

Factorbird - a Parameter Server Approach to Distributed Matrix Factorization

- Computer Science
- 2014

- 27
- Highly Influential
- PDF

Exploiting Bounded Staleness to Speed Up Big Data Analytics

- Computer Science
- 2014

- 114
- Highly Influential
- PDF