Lexicon-Based Methods for Sentiment Analysis

  title={Lexicon-Based Methods for Sentiment Analysis},
  author={Maite Taboada and Julian Brooke and Milan Tofiloski and Kimberly D. Voll and Manfred Stede},
  journal={Computational Linguistics},
We present a lexicon-based approach to extracting sentiment from text. The Semantic Orientation CALculator (SO-CAL) uses dictionaries of words annotated with their semantic orientation (polarity and strength), and incorporates intensification and negation. SO-CAL is applied to the polarity classification task, the process of assigning a positive or negative label to a text that captures the text's opinion towards its main subject matter. We show that SO-CAL's performance is consistent across… 

Explaining Sentiment from Lexicon

A detailed set of linguistic rules that allow to understand the text fragments which are semantically linked to a given concept of interest in a text, which are designed in the spirit of the recent Interpretable AI trend.

Unsupervised Word Polarity Tagging by Exploiting Continuous Word Representations

This work explores the possibility of automatically generating domain adapted polarity lexicons employing continuous word representations, in particular the popular tool Word2Vec, and shows a qualitative evaluation of a small set of words.

Automatic Construction of Domain-Specific Sentiment Lexicons for Polarity Classification

A strategy to build sentiment lexicons (positive and negative words) from corpora and shows that the lexicon generated automatically outperforms other manual lexicons when they are used as features of a supervised sentiment classifier.

Sentiment Analysis based on Specific Dictionary and Sentence Analysis

This work proposes and evaluates new features to be used in a word polarity based approach to sentiment classification, and measures the performance of the sentiment analysis engine using the domainadapted lexicon on a large subset of the TripAdvisor database.

Sentiment analysis for reviews and microtexts based on lexico-syntactic knowledge

Experimental results show the utility of employing syntactic information to classify the polarity in both types of texts and the importance of defining mechanisms to adapt the system for a specific domain and social medium.

Semantic Orientation-Based Approach for Sentiment Analysis

This chapter explores the corpus-based semantic orientation approach for sentiment analysis, which initially mines sentiment-bearing terms from the unstructured text and further computes the polarity of the terms.

Adaptation and Use of Subjectivity Lexicons for Domain Dependent Sentiment Classification

This work studies the effect of subjectivity-based features on sentiment classification on two lexicons and proposes new subjectivity -based features for sentiment classification based on the occurrence of subjective words in review texts.

Analysing domain suitability of a sentiment lexicon by identifying distributionally bipolar words

A method to assess suitability of generic sentiment lexicons for a given domain, namely to identify frequent bigrams where a polar word switches polarity, and can be further used to quantify ambiguous words.

CT-SPA: Text sentiment polarity prediction model using semi-automatically expanded sentiment lexicon

The proposed method uses two sentiment dictionaries from different sources: the Chinese sentiment dictionary CSWN that integrates Chinese WordNet with SentiWordNet, and the sentiment dictionary obtained from a training corpus labeled with sentiment polarities.



Review Sentiment Scoring via a Parse-and-Paraphrase Paradigm

An approach to extracting adverb-adjective-noun phrases based on clause structure obtained by parsing sentences into a hierarchical representation is proposed and a robust general solution for modeling the contribution of adverbials and negation to the score for degree of sentiment is proposed.

More than Words: Syntactic Packaging and Implicit Sentiment

A strong predictive connection between linguistically well motivated features and implicit sentiment is established, and it is shown how computational approximations of these features can be used to improve on existing state-of-the-art sentiment classification results.

Cross-Linguistic Sentiment Analysis: From English to Spanish

The results indicate that, although languageindependent methods provide a decent baseline performance, there is also a significant cost to automation, and thus the best path to long-term improvement is through the inclusion of language-specific knowledge and resources.

Using appraisal groups for sentiment analysis

A new method for sentiment classification based on extracting and analyzing appraisal groups such as ``very good'' or ``not terribly funny'' is presented, based on several task-independent semantic taxonomies based on Appraisal Theory.

Generating High-Coverage Semantic Orientation Lexicons From Overtly Marked Words and a Thesaurus

This work proposes a simple approach to generate a high-coverage semantic orientation lexicon, which includes both individual words and multi-word expressions, using only a Roget-like thesaurus and a handful of affixes and has properties that support the Polyanna Hypothesis.

A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts

A novel machine-learning method is proposed that applies text-categorization techniques to just the subjective portions of the document, which greatly facilitates incorporation of cross-sentence contextual constraints.

Learning Subjective Adjectives from Corpora

This paper identifies strong clues of subjectivity using the results of a method for clustering words according to distributional similarity (Lin 1998), seeded by a small amount of detailed manual annotation.

Mining WordNet for a Fuzzy Sentiment: Sentiment Tag Extraction from WordNet Glosses

Net Overlap Score can be used as a measure of the words degree of membership in the fuzzy category of sentiment: the core adjectives were identified most accurately both by STEP and by human annotators, while the words on the periphery of the category had the lowest scores and were associated with low rates of inter-annotator agreement.

What's great and what's not: learning to classify the scope of negation for improved sentiment analysis

This paper presents a negation detection system based on a conditional random field modeled using features from an English dependency parser, and a new negation corpus is presented for the domain of English product reviews obtained from the open web.

Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis

A new approach to phrase-level sentiment analysis is presented that first determines whether an expression is neutral or polar and then disambiguates the polarity of the polar expressions.