Bayesian Low-Rank Interpolative Decomposition for Complex Datasets

@article{Lu2022BayesianLI,
  title={Bayesian Low-Rank Interpolative Decomposition for Complex Datasets},
  author={Jun Lu},
  journal={ArXiv},
  year={2022},
  volume={abs/2205.14825}
}
  • Jun Lu
  • Published 30 May 2022
  • Computer Science
  • ArXiv
In this paper, we introduce a probabilistic model for learning interpolative decomposition (ID), which is commonly used for feature selection, low-rank approximation, and identifying hidden patterns in data, where the matrix factors are latent variables associated with each data dimension. Prior densities with support on the specified subspace are used to address the constraint for the magnitude of the factored component of the observed matrix. Bayesian inference procedure based on Gibbs… 

Figures and Tables from this paper

Robust Bayesian Nonnegative Matrix Factorization with Implicit Regularizers

TLDR
A probabilistic model with implicit norm regularization for learning nonnegative matrix factorization (NMF) that is commonly used for predicting missing values and hidden patterns in the data is introduced and proposed Bayesian NMF GL 22 and GL 22, ∞ models lead to robust predictions for different data values and avoid over-prediction compared with competitive BayesianNMF approaches.

Comparative Study of Inference Methods for Interpolative Decomposition

We propose a probabilistic model with automatic relevance determination (ARD) for computing interpolative decomposition (ID), that is commonly used for low-rank approximation, feature selection, and

References

SHOWING 1-10 OF 29 REFERENCES

Flexible and Hierarchical Prior for Bayesian Nonnegative Matrix Factorization

TLDR
A probabilistic model for learning nonnegative matrix factorization (NMF) that is commonly used for predicting missing values and hidden patterns in the data, in which the matrix factors are latent variables as-sociated with each data dimension is introduced.

Matrix Decomposition and Applications

TLDR
The sole aim of this survey is to give a self-contained introduction to concepts and mathematical tools in numerical linear algebra and matrix analysis in order to seamlessly introduce matrix decomposition techniques and their applications in subsequent sections.

A survey on Bayesian inference for Gaussian mixture model

  • Jun Lu
  • Computer Science
    ArXiv
  • 2021
TLDR
This survey is primarily a summary of purpose, significance of important background and techniques for Gaussian mixture model, e.g., Dirichlet prior, Chinese restaurant process, and most importantly the origin and complexity of the methods which shed light on their modern applications.

Numerical Matrix Decomposition and its Modern Applications: A Rigorous First Course

  • Jun Lu
  • Computer Science
    ArXiv
  • 2021
TLDR
The sole aim of this survey is to give a self-contained introduction to concepts and mathematical tools in numerical linear algebra and matrix analysis in order to seamlessly introduce matrix decomposition techniques and their applications in subsequent sections.

Efficient Algorithms for Constructing an Interpolative Decomposition

TLDR
This work introduces two optimized algorithms to construct an interpolative decomposition along with numerical evidence that they outperform the current state of the art.

A Survey of Methods for Low-Power Deep Learning and Computer Vision

TLDR
This paper surveys the progress of low-power deep learning and computer vision, specifically in regards to inference, and discusses the methods for compacting and accelerating DNN models.

Prior and Likelihood Choices for Bayesian Matrix Factorisation on Small Datasets

TLDR
In this paper, several insights are offered into the trade-offs between prior and likelihood choices for Bayesian matrix factorisation on small datasets - such as that Poisson models give poor predictions, and that nonnegative models are more constrained than real-valued ones.

Harnessing Connectivity in a Large-Scale Small-Molecule Sensitivity Dataset.

TLDR
An annotated cluster multidimensional enrichment analysis is developed to explore the associations between groups of small molecules and groups of CCLs in a new, quantitative sensitivity dataset and reveals insights into small-molecule mechanisms of action, and genomic features that associate with CCL response to small- molecule treatment.

Non-negative matrix factorization by maximizing correntropy for cancer clustering

TLDR
The proposed NMF-MCC method is significantly more accurate than the state-of-the-art methods in cancer clustering and maximizes the correntropy between the product of the two matrices and the original matrix.

The MovieLens Datasets: History and Context

TLDR
The history of MovieLens and the MovieLens datasets is documents, including a discussion of lessons learned from running a long-standing, live research platform from the perspective of a research organization, and best practices and limitations of using the Movie Lens datasets in new research are documented.