Content-based recommendations with Poisson factorization

Abstract

We develop collaborative topic Poisson factorization (CTPF), a generative model of articles and reader preferences. CTPF can be used to build recommender systems by learning from reader histories and content to recommend personalized articles of interest. In detail, CTPF models both reader behavior and article texts with Poisson distributions, connecting the latent topics that represent the texts with the latent preferences that represent the readers. This provides better recommendations than competing methods and gives an interpretable latent space for understanding patterns of readership. Further, we exploit stochastic variational inference to model massive real-world datasets. For example, we can fit CPTF to the full arXiv usage dataset, which contains over 43 million ratings and 42 million word counts, within a day. We demonstrate empirically that our model outperforms several baselines, including the previous state-of-the art approach.

Extracted Key Phrases

5 Figures and Tables

0102030201520162017
Citations per Year

Citation Velocity: 17

Averaging 17 citations per year over the last 3 years.

Learn more about how we calculate this metric in our FAQ.

Cite this paper

@inproceedings{Gopalan2014ContentbasedRW, title={Content-based recommendations with Poisson factorization}, author={Prem Gopalan and Laurent Charlin and David M. Blei}, booktitle={NIPS}, year={2014} }