SentiWords: Deriving a High Precision and High Coverage Lexicon for Sentiment Analysis

@article{Gatti2016SentiWordsDA,
  title={SentiWords: Deriving a High Precision and High Coverage Lexicon for Sentiment Analysis},
  author={Lorenzo Gatti and Marco Guerini and Marco Turchi},
  journal={IEEE Transactions on Affective Computing},
  year={2016},
  volume={7},
  pages={409-421}
}
Deriving prior polarity lexica for sentiment analysis - where positive or negative scores are associated with words out of context - is a challenging task. Usually, a trade-off between precision and coverage is hard to find, and it depends on the methodology used to build the lexicon. Manually annotated lexica provide a high precision but lack in coverage, whereas automatic derivation from pre-existing knowledge guarantees high coverage at the cost of a lower precision. Since the automatic… 
Automatic construction of domain sentiment lexicon for semantic disambiguation
TLDR
An automatic method for the construction of the domain-specific sentiment lexicon (SDS-lex) to avoid sentimental ambiguity is proposed, which incorporates the sentiment information not only from the existing lexicons but also from the corpus by using the improved TF-IDF algorithm.
Neural domain adaptation of sentiment lexicons
TLDR
This paper proposes a neural network approach that modifies a sentiment lexicon using distantly annotated text of a certain domain and presents a completely data-driven domain characterization metric that measures the centrality of a set of documents.
Experimental Results on Customer Reviews Using Lexicon-Based Word Polarity Identification Method
TLDR
A new implementation of the lexicon-based word polarity identification method that operates by computing the semantic relatedness between the context expansion set of the target word and a synonym expansion set comprising the synonyms of all words surrounding the targetword within the original text fragment is introduced.
SentiDraw: Using star ratings of reviews to develop domain specific sentiment lexicon for polarity determination
TLDR
A novel approach ‘SentiDraw’ is proposed where the probability distribution of words across reviews with different star ratings is used to calculate their SO scores and it is shown that lexicons built with SentiDraw method delivers superior performance versus other lexicons in six out of nine cases.
Exploring automatic approaches for sentiment lexicon creation for Norwegian
TLDR
This work aims to fill some of the gap for sentiment analysis for Norwegian, by performing large-scale experiments on automatic methods for sentiment lexicon creation and expansion, using distributional models as well as resources containing lexical relations.
SentiStrength-SE: Exploiting domain specificity for improved sentiment analysis in software engineering text
TLDR
The empirical evaluations confirm that the domain specificity exploited in the SentiStrength-SE enables it to substantially outperform the existing domain-independent tools/toolkits in detecting sentiments in software engineering text.
DepecheMood++: a Bilingual Emotion Lexicon Built Through Simple Yet Powerful Techniques
TLDR
This work contributes an extension of an existing and widely used emotion lexicon for English and a novel version of the lexicon targeting Italian, and shows how simple techniques can be used to boost performances on datasets and tasks of varying degree of domain-specificity.
Sentiment Lexicon Construction With Hierarchical Supervision Topic Model
TLDR
A novel hierarchical supervision topic model to construct a topic-adaptive sentiment lexicon (TaSL) for higher-level classification tasks and performs better than the existing manual sentiment Lexicon (MPQA), the topic model based domain-specific lexicon(ssLDA), the expanded lexicons(Weka-ED, Weka-STS, NRC, Liu's), and deep neural network based lexicons (nnLexicon, HIT, HSSWE).
A Comparative Study of Polarity Lexicons to Identify Extreme Opinions
TLDR
This paper compares a method to automatically build a sentiment lexicon, with four well-known sentiment lexicons, and a set of textual features is integrated into the classifiers so as to analyze how these textual features improve the lexicon performance.
Automatic Construction of Domain-Specific Sentiment Lexicons for Polarity Classification
TLDR
A strategy to build sentiment lexicons (positive and negative words) from corpora and shows that the lexicon generated automatically outperforms other manual lexicons when they are used as features of a supervised sentiment classifier.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 78 REFERENCES
Sentiment Analysis: How to Derive Prior Polarities from SentiWordNet
TLDR
This paper compares the most often used techniques together with newly proposed ones and incorporate all of them in a learning framework to see whether blending them can further improve the estimation of prior polarity scores.
SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining
TLDR
This work discusses SENTIWORDNET 3.0, a lexical resource explicitly devised for supporting sentiment classification and opinion mining applications, and reports on the improvements concerning aspect (b) that it embodies with respect to version 1.0.
SentiFul: A Lexicon for Sentiment Analysis
TLDR
The algorithm for automatic extraction of new sentiment-related compounds from WordNet is elaborated using words from SentiFul as seeds for sentiment-carrying base components and applying the patterns of compound formations.
Are SentiWordNet scores suited for multi-domain sentiment classification?
  • K. Denecke
  • Computer Science
    2009 Fourth International Conference on Digital Information Management
  • 2009
TLDR
The results show that for cross-domain sentiment analysis rule-based approaches with fix opinion lexica are unsuited and for machine-learning based sentiment classification a mixture of documents of different domains achieves good results.
SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining
TLDR
SENTIWORDNET is a lexical resource in which each WORDNET synset is associated to three numerical scores Obj, Pos and Neg, describing how objective, positive, and negative the terms contained in the synset are.
Lexicon-Based Methods for Sentiment Analysis
TLDR
The Semantic Orientation CALculator (SO-CAL) uses dictionaries of words annotated with their semantic orientation (polarity and strength), and incorporates intensification and negation, and is applied to the polarity classification task.
Sentiment Lexicon Creation from Lexical Resources
TLDR
A corpus-based evaluation of several automated methods for creating lexicons, exploiting vast lexical resources, and considers propagating the sentiment of a seed set of words through semantic relations or through PageRank-based similarities, which turns out to outperform the others.
Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank
TLDR
A Sentiment Treebank that includes fine grained sentiment labels for 215,154 phrases in the parse trees of 11,855 sentences and presents new challenges for sentiment compositionality, and introduces the Recursive Neural Tensor Network.
Depeche Mood: a Lexicon for Emotion Analysis from Crowd Annotated News
TLDR
This paper presents a novel approach for extracting - in a totally automated way - a high-coverage and high-precision lexicon of roughly 37 thousand terms annotated with emotion scores, called DepecheMood, which exploits in an original way 'crowd-sourced' affective annotation implicitly provided by readers of news articles from rappler.com.
UPAR7: A knowledge-based system for headline sentiment tagging
TLDR
The results show that working with linguistic techniques and a broad-coverage lexicon is a viable approach to sentiment analysis of headlines, and a knowledge-based system achieves high accuracy on emotion and valence annotation.
...
1
2
3
4
5
...