Mining and summarizing customer reviews

  title={Mining and summarizing customer reviews},
  author={Minqing Hu and Bing Liu},
  journal={Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining},
  • Minqing HuBing Liu
  • Published 22 August 2004
  • Business
  • Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Merchants selling products on the Web often ask their customers to review the products that they have purchased and the associated services. [] Key Method Our task is performed in three steps: (1) mining product features that have been commented on by customers; (2) identifying opinion sentences in each review and deciding whether each opinion sentence is positive or negative; (3) summarizing the results. This paper proposes several novel techniques to perform these tasks. Our experimental results using…

Aspect Opinion Mining on Customer Reviews

e-commerce is becoming more and more popular and interactive, Merchants selling products on the Web 2.0 often ask their customers to review the products which they have purchased. The number of


It is a common practice that merchants selling products on the Web ask their customers to review the products and associated services. As ecommerce is becoming more and more popular, the number of

1 An Overview of Sentiment Analysis

This paper focuses on the frequently used techniques for implementing sentiment analysis of the online product reviews and Machine Learning and Lexicon based approach.

A Survey on Detection of Reviews Using Sentiment Classification of Methods

Various techniques to classify opinion as positive or negative and also detection of reviews as spam or non-spam are surveyed.

Summarizing customer reviews based on product features

This paper combines the LDA model and the association rules to extract the product features and the corresponding sentiment words of a product, and uses cross-validation to prune the extract result.

Mining millions of reviews: a technique to rank products based on importance of reviews

A product ranking model that applies weights to product review factors to calculate a products ranking score is presented which produces product ranking results that closely relate to the products sales ranking as reported by the retailer.

Summarization of customer reviews for a product on a website using natural language processing

HELP-ME-BUY APP is an android application that assists buyers in online shopping by analyzing and summarizing all reviews which will help the user decide what other buyers have experienced on buying this product.

Feature Extraction and Opinion Mining in Online Product Reviews

A system, which automatically extracts the product features from the reviews and determines if they have been expressed in a positive or a negative way by the reviewers and a supervised machine learning algorithm based polarity classifier that determines the sentiment of the review sentences with respect to the prominent features is described.


The merchants Advertisement Products in the web ask their customers to check the products and related services. As e-business is becoming more fashionable, the number of voters that a product

Leveraging User Opinions for Product Catalog Enrichment

This thesis formulated and investigated the following problem: enriching product catalogs with user opinions at the attribute granularity level as a new form of opinion summarization and developed two distinct methods.



Mining Opinion Features in Customer Reviews

This project aims to summarize all the customer reviews of a product by mining opinion/product features that the reviewers have commented on and a number of techniques are presented to mine such features.

Mining product reputations on the Web

A new framework for mining product reputations on the Internet is presented, which offers a drastic reduction in the overall cost of reputation analysis over that of conventional survey approaches and supports the discovery of knowledge from the pool of opinions on the web.

Learning to classify documents according to genre

The idea of domain transfer—genre classifiers should be reusable across multiple topics—which does not arise in standard text classification is introduced and different features for building genre classifiers and their ability to transfer across multiple-topic domains are investigated.

Summarizing text documents: sentence selection and evaluation metrics

An analysis of news-article summaries generated by sentence selection, using a normalized version of precision-recall curves with a baseline of random sentence selection to evaluate features and empirical results show the importance of corpus-dependent baseline summarization standards, compression ratios and carefully crafted long queries.

Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews

A simple unsupervised learning algorithm for classifying reviews as recommended (thumbs up) or not recommended (Thumbs down) if the average semantic orientation of its phrases is positive.

On the Computation of Point of View

The focus of the current research is to build a less gullible story understander by encoding in it a means to recognize point of view, not only for information retrieval tasks which demand a search for credible stories, but also in future entertainment technologies which will be capable of finding and then assembling together into a unified presentation.

Integrating Classification and Association Rule Mining

The integration is done by focusing on mining a special subset of association rules, called class association rules (CARs), and shows that the classifier built this way is more accurate than that produced by the state-of-the-art classification system C4.5.

Thumbs up? Sentiment Classification using Machine Learning Techniques

This work considers the problem of classifying documents not by topic, but by overall sentiment, e.g., determining whether a review is positive or negative, and concludes by examining factors that make the sentiment classification problem more challenging.

Combining Low-Level and Summary Representations of Opinions for Multi-Perspective Question Answering

This paper proposes an approach to multi-perspective question answering that views the task as one of opinion-oriented information extraction, and describes an annotation scheme developed for the low-level representation of opinions.

Technical terminology: some linguistic properties and an algorithm for identification in text

This paper identifies some linguistic properties of technical terminology, and uses them to formulate an algorithm for identifying technical terms in running text, and presents a terminology indentification algorithm that is motivated by these linguistic properties.