Hidden factors and hidden topics: understanding rating dimensions with review text

@article{McAuley2013HiddenFA,
  title={Hidden factors and hidden topics: understanding rating dimensions with review text},
  author={Julian McAuley and Jure Leskovec},
  journal={Proceedings of the 7th ACM conference on Recommender systems},
  year={2013}
}
  • Julian McAuley, J. Leskovec
  • Published 12 October 2013
  • Computer Science
  • Proceedings of the 7th ACM conference on Recommender systems
In order to recommend products to users we must ultimately predict how a user will respond to a new product. To do so we must uncover the implicit tastes of each user as well as the properties of each product. For example, in order to predict whether a user will enjoy Harry Potter, it helps to identify that the book is about wizards, as well as the user's level of interest in wizardry. User feedback is required to discover these latent product and user dimensions. Such feedback often comes in… 

Figures and Tables from this paper

Review rating prediction using combined latent topics and associated sentiments: an empirical review
TLDR
An approach that performs review rating prediction task by using the latent topics extracted from reviews and their associated sentiments is proposed in this paper and shows that the proposed model works well with latent topics and sentiments when the underlying model is trained using a deep learning technique.
Rating Prediction with Topic Gradient Descent Method for Matrix Factorization in Recommendation
TLDR
A method of rating prediction which uses both the ratings and reviews, including a new first-order gradient method for matrix factorization, named Topic Gradient Descent (TGD).
A Tensor Factorization on Rating Prediction for Recommendation by Feature Extraction from Reviews
TLDR
This work proposes two rating prediction models TF-Doc2Vec and TF-LDA, an extended version of Matrix Factorization by adding another dimension, and considers seasonal context as the additional dimension.
Rating-Boosted Latent Topics: Understanding Users and Items with Ratings and Reviews
TLDR
Experimental results on 26 real-world datasets from Amazon demonstrate that the approach significantly improves the rating prediction accuracy compared with various state-of-the-art models, such as LFM, HFT, CTR and RMR models, and much higher improvement is achieved for users who have few ratings, which verifies the effectiveness of the proposed approach.
Integrating Topic and Latent Factors for Scalable Personalized Review-based Rating Prediction
TLDR
An integrated topic and latent factor model (ITLFM) is proposed, which combines topics and latent factors in a linear way to make them complement each other for better accuracies in rating prediction tasks.
Evaluating Vector Representations from User's Reviews in a Recommendation Task
TLDR
This work evaluates the use of text mining methods to extract and represent relevant information about user reviews, which were used alongside with rating data, as input of a content-based recommendation algorithm, and hypothesized that using the considered strategies, it is possible to create more meaningful and concise representations than the traditional bag-of-words model, and yet, increase the performance of recommendation systems.
Understanding Rating Behaviour and Predicting Ratings by Identifying Representative Users
TLDR
This study works with restaurant reviews to identify users whose ratings are influenced by dimensions like 'Service', 'Atmosphere' etc. in order to predict restaurant ratings and understand the variation in rating behaviour across different cuisines.
A Fine-Grained Latent Aspects Model for Recommendation: Combining Each Rating with Its Associated Review
TLDR
A Fine-grained Latent Aspects Model (FLAM) is proposed, which learns the topics for each review with the corresponding latent aspect ratings of the user and the item and can alleviate the cold-start problem.
A Tensor Factorization on Rating Prediction for Recommendation by Feature Extraction from Reviews
TLDR
This work proposes a framework TF+ for rating prediction models based on Tensor Factorization which is an extended version of Matrix Factorization (MF) by adding another dimension, seasonal context as the additional dimension, and conducts experimental comparisons.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 36 REFERENCES
Learning Attitudes and Attributes from Multi-aspect Reviews
TLDR
This paper builds models for rating systems in which users leave separate ratings for each aspect of a product, and introduces new corpora consisting of five million reviews, rated with between three and six aspects.
Beyond the Stars: Improving Rating Predictions using Review Text Content
TLDR
This paper proposes new ad-hoc and regression-based recommendation measures, that both take into account the textual component of user reviews, and shows that using textual information results in better general or personalized review score predictions than those derived from the numerical star ratings given by the users.
Modeling online reviews with multi-grain topic models
TLDR
This paper presents a novel framework for extracting ratable aspects of objects from online user reviews and argues that multi-grain models are more appropriate for this task since standard models tend to produce topics that correspond to global properties of objects rather than aspects of an object that tend to be rated by a user.
Multi-Faceted Rating of Product Reviews
TLDR
This work tackles the problem of rating (i.e., attributing a numerical score of satisfaction to) consumer reviews based on their textual content by exploring several aspects of the problem, with special emphasis on how to generate vectorial representations of the text by means of POS tagging, sentiment analysis, and feature selection for ordinal regression learning.
ETF: extended tensor factorization model for personalizing prediction of review helpfulness
TLDR
This paper proposes a series of increasingly sophisticated probabilistic graphical models, based on Matrix Factorization and Tensor Factorization, and evaluates the proposed models using a database of 1.5 million reviews and more than 13 million quality ratings obtained from Epinions.com.
Rated aspect summarization of short comments
TLDR
The proposed methods are quite general and can be used to generate rated aspect summary automatically given any collection of short comments each associated with an overall rating.
From amateurs to connoisseurs: modeling the evolution of user expertise through online reviews
TLDR
A latent factor recommendation system that explicitly accounts for each user's level of experience is developed that leads to better recommendations, and allows the role of user experience and expertise on a novel dataset of fifteen million beer, wine, food, and movie reviews.
Latent aspect rating analysis on review text data: a rating regression approach
TLDR
Empirical experiments show that the proposed latent rating regression model can effectively solve the problem of LARA, and that the detailed analysis of opinions at the level of topical aspects enabled by the proposed model can support a wide range of application tasks, such as aspect opinion summarization, entity ranking based on aspect ratings, and analysis of reviewers rating behavior.
Finding a needle in a haystack of reviews: cold start context-based hotel recommender system
TLDR
A cold start hotel recommender system which uses the text of the reviews as its main data, and a novel weighted algorithm for text mining, which defines context groups based on reviews extracted from TripAdvisor.com and Venere.com.
On the design of LDA models for aspect-based opinion mining
TLDR
A set of design guidelines for aspect-based opinion mining is presented by discussing a series of increasingly sophisticated LDA models and arguing that these models represent the essence of the major published methods and allow us to distinguish the impact of various design decisions.
...
1
2
3
4
...