Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions

Abstract

We introduce a novel machine learning framework based on recursive autoencoders for sentence-level prediction of sentiment label distributions. Our method learns vector space representations for multi-word phrases. In sentiment prediction tasks these representations outperform other state-of-the-art approaches on commonly used datasets, such as movie reviews, without using any pre-defined sentiment lexica or polarity shifting rules. We also evaluate the model’s ability to predict sentiment distributions on a new dataset based on confessions from the experience project. The dataset consists of personal user stories annotated with multiple labels which, when aggregated, form a multinomial distribution that captures emotional reactions. Our algorithm can more accurately predict distributions over such labels compared to several competitive baselines.

Extracted Key Phrases

10 Figures and Tables

0501001502011201220132014201520162017
Citations per Year

649 Citations

Semantic Scholar estimates that this publication has 649 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@inproceedings{Socher2011SemiSupervisedRA, title={Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions}, author={Richard Socher and Jeffrey Pennington and Eric H. Huang and Andrew Y. Ng and Christopher D. Manning}, booktitle={EMNLP}, year={2011} }