Combining Sentiment Lexica with a Multi-View Variational Autoencoder

  title={Combining Sentiment Lexica with a Multi-View Variational Autoencoder},
  author={Alexander Miserlis Hoyle and Lawrence Wolf-Sonkin and Hanna M. Wallach and Ryan Cotterell and Isabelle Augenstein},
When assigning quantitative labels to a dataset, different methodologies may rely on different scales. In particular, when assigning polarities to words in a sentiment lexicon, annotators may use binary, categorical, or continuous labels. Naturally, it is of interest to unify these labels from disparate scales to both achieve maximal coverage over words and to create a single, more robust sentiment lexicon while retaining scale coherence. We introduce a generative model of sentiment lexica to… 
Quantifying Gender Bias Towards Politicians in Cross-Lingual Language Models
A simple method to probe pre-trained language models for gender bias, which is used to effect a multi-lingual study of gender bias towards politicians, and suggests that larger language models do not tend to be significantly more gender-biased than smaller ones.
Unsupervised Discovery of Gendered Language through Latent-Variable Modeling
A generative latent-variable model is introduced that jointly represents adjective (or verb) choice, with its sentiment, given the natural gender of a head (or dependent) noun.
Joint Emotion Label Space Modelling for Affect Lexica
The overall findings are that emotion lexica can offer complementary information to even extremely large pre-trained models such as BERT, and the performance of the models is comparable to state-of-the art models that are specifically engineered for certain datasets, and even outperform the state of the art on four datasets.
A Unified Feature Representation for Lexical Connotations
A method for creating lexical representations that capture connotations within the embedding space is presented and it is shown that using the embeddings provides a statistically significant improvement on the task of stance detection when data is limited.


SentiMerge: Combining Sentiment Lexicons in a Bayesian Framework
This paper introduces a Bayesian probabilistic model, which can simultaneously combine polarity scores from several data sources and estimate the quality of each source, and applies this algorithm to a set of four German sentiment lexicons, to produce the SentiMerge lexicon, which is made publically available.
Learning Word Vectors for Sentiment Analysis
This work presents a model that uses a mix of unsupervised and supervised techniques to learn word vectors capturing semantic term--document information as well as rich sentiment content, and finds it out-performs several previously introduced methods for sentiment classification.
Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification
This work extends to sentiment classification the recently-proposed structural correspondence learning (SCL) algorithm, reducing the relative error due to adaptation between domains by an average of 30% over the original SCL algorithm and 46% over a supervised baseline.
VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text
Interestingly, using the authors' parsimonious rule-based model to assess the sentiment of tweets, it is found that VADER outperforms individual human raters, and generalizes more favorably across contexts than any of their benchmarks.
Targeted Aspect-Based Sentiment Analysis via Embedding Commonsense Knowledge into an Attentive LSTM
A novel solution to targeted aspect-based sentiment analysis, which tackles the challenges of both aspect- based sentiment analysis and targeted sentiment analysis by exploiting commonsense knowledge by augmenting the LSTM network with a hierarchical attention mechanism.
Tweester at SemEval-2016 Task 4: Sentiment Analysis in Twitter Using Semantic-Affective Model Adaptation
This system comprises of multiple independent models such as neural networks, semantic-affective models and topic modeling combined in a probabilistic way to predict a tweet’s sentiment and a late fusion scheme is adopted for the final decision.
Combining Sentiment Lexicons of Arabic Terms
This paper used the method to normalize and unify lexicon items and merge duplicated lexicon Items from twelve lexicons for (in)formal Arabic to result in a coherent Arabic sentiment lexicon with the largest number of terms.
SemEval-2017 Task 4: Sentiment Analysis in Twitter
The fourth year of the SemEval-2016 Task 4 comprises five subtasks, three of which represent a significant departure from previous editions, and the task continues to be very popular, attracting a total of 43 teams.
Sentiment Analysis of Short Informal Texts
We describe a state-of-the-art sentiment analysis system that detects (a) the sentiment of short informal textual messages such as tweets and SMS (message-level task) and (b) the sentiment of a word
SenticNet 3: A Common and Common-Sense Knowledge Base for Cognition-Driven Sentiment Analysis
SenticNet 3 models nuanced semantics and sentics (that is, the conceptual and affective information associated with multi-word natural language expressions), representing information with a symbolic opacity of an intermediate nature between that of neural networks and typical symbolic systems.