Modeling and Predicting the Helpfulness of Online Reviews

  title={Modeling and Predicting the Helpfulness of Online Reviews},
  author={Yang Liu and Xiangji Huang and Aijun An and Xiaohui Yu},
  journal={2008 Eighth IEEE International Conference on Data Mining},
Online reviews provide a valuable resource for potential customers to make purchase decisions. However, the sheer volume of available reviews as well as the large variations in the review quality present a big impediment to the effective use of the reviews, as the most helpful reviews may be buried in the large amount of low quality reviews. The goal of this paper is to develop models and algorithms for predicting the helpfulness of reviews, which provides the basis for discovering the most… 

Figures and Tables from this paper

A quality-aware model for sales prediction using reviews

This paper proposes a regression model that explicitly takes into account the quality factor, and discusses how this quality information can be predicted when it is not readily available, and results on a movie review dataset confirm the effectiveness of the proposed model.

Predicting the helpfulness of online customer reviews: The role of title features

The proposed prediction model is examined on two real-life Amazon datasets using machine learning techniques and the results show a promising performance, however, feature importance analysis reveals the low importance of title features in the predictive model.

Exploring the influential reviewer, review and product determinants for review helpfulness

Findings show that reviews which have large number of comments, large values of sentiment and polarity scores receive more helpful votes, and the reviewer activity length and recency are statistically significant predictors for helpfulness prediction.

Context-aware review helpfulness rating prediction

This paper identifies various types of context information, model them mathematically, and propose a context-aware review helpfulness rating prediction framework CAP, which demonstrates the effectiveness of the proposed framework and the importance of context awareness in solving the review helpfuls rating prediction problem.

From Helpfulness Prediction to Helpful Review Retrieval for Online Product Reviews

This paper defines a helpful review retrieval task to support the customers by returning a ranked list of helpful reviews according to their helpfulness about the product of their interest and proposes a method with an enhanced list of features for review representation and a multiple linear regression model using the elastic net regularization method.

Predicting the helpfulness of online product reviews: A multilingual approach

Predicting Helpfulness of Crowd-Sourced Reviews: A Survey

This paper aims to review the existing literature on review helpfulness prediction, to identify data sources, ML techniques and potential challenges, and to give a quick overview of state-of-the-art techniques and challenges.

Aspect-Based Helpfulness Prediction for Online Product Reviews

This paper proposes an aspect extraction model making use of product category information to balance the aspects of a general category and those of subcategories under it and shows that it can improve helpfulness prediction by 7% than the baseline on 5 popular product categories from

Helpfulness Prediction of Online Product Reviews

This paper proposes a feature extraction technique that can quantify and measure helpfulness for each product based on user submitted reviews that can be applied to the problem of electronic product review accumulation using helpfulness prediction.

Linguistic features for review helpfulness prediction




Designing novel review ranking systems: predicting the usefulness and impact of reviews

It is shown that subjectivity analysis can give useful clues about the helpfulness of a review and about its impact on sales and the results can have several implications for the market design of online opinion forums.

Movie review mining and summarization

A multi-knowledge based approach is proposed, which integrates WordNet, statistical analysis and movie knowledge, and the experimental results show the effectiveness of the proposed approach in movie review mining and summarization.

Mining and summarizing customer reviews

This research aims to mine and to summarize all the customer reviews of a product, and proposes several novel techniques to perform these tasks.

Exploring the Value of Online Reviews to Organizations: Implications for Revenue Forecasting and Planning

Evidence is provided for the claim that online ratings can be considered as a useful proxy for word-of-mouth about movies and for the value of publicly available online forum information to firms for real-time forecasting and competitive analysis.

ARSA: a sentiment-aware model for predicting sales performance using blogs

ARSA is presented, an autoregressive sentiment-aware model, to utilize the sentiment information captured by S-PLSA for predicting product sales performance and is compared with alternative models that do not take into account the sentiment Information.

Opinion spam and analysis

It is shown that opinion spam is quite different from Web spam and email spam, and thus requires different detection techniques, and therefore requires some novel techniques to detect them.

Mining the peanut gallery: opinion extraction and semantic classification of product reviews

This work develops a method for automatically distinguishing between positive and negative reviews and draws on information retrieval techniques for feature extraction and scoring, and the results for various metrics and heuristics vary depending on the testing situation.

Show me the money!: deriving the pricing power of product features by mining consumer reviews

A novel hybrid technique combining text mining and econometrics that models consumer product reviews as elements in a Tensor product of feature and evaluation spaces is developed and impute the quantitative impact of consumer reviews on product demand as a linear functional from this tensor product space.

Identifying opinion leaders in the blogosphere

The InfluenceRank algorithm ranks blogs according to not only how important they are as compared to other blogs, but also how novel the information they can contribute to the network.

Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews

A simple unsupervised learning algorithm for classifying reviews as recommended (thumbs up) or not recommended (Thumbs down) if the average semantic orientation of its phrases is positive.