SentiBench - a benchmark comparison of state-of-the-practice sentiment analysis methods

@article{Ribeiro2016SentiBenchA,
  title={SentiBench - a benchmark comparison of state-of-the-practice sentiment analysis methods},
  author={Filipe Nunes Ribeiro and Matheus Ara{\'u}jo and Pollyanna Gonçalves and Marcos Andr{\'e} Gonçalves and Fabr{\'i}cio Benevenuto},
  journal={EPJ Data Science},
  year={2016},
  volume={5},
  pages={1-29}
}
In the last few years thousands of scientific papers have investigated sentiment analysis, several startups that measure opinions on real data have emerged and a number of innovative products related to this theme have been developed. There are multiple methods for measuring sentiments, including lexical-based and supervised machine learning methods. Despite the vast interest on the theme and wide popularity of some methods, it is unclear which one is better for identifying the polarity (i.e… 
10SENT: A stable sentiment analysis method based on the combination of off‐the‐shelf approaches
TLDR
The experimental results demonstrate that the combined method (aka, 10SENT) improves the effectiveness of the classification task, and tackles the key problem of cross‐domain low stability and produces the best results in almost all considered contexts, without any additional costs.
On the combination of "off-the-shelf" sentiment analysis methods
TLDR
This paper proposes ensemble combination methods aimed at combining the outputs of several state-of-the-art proposals in order to maximize both goals, which sometimes can be conflicting.
iFeel 2.0: A Multilingual Benchmarking System for Sentence-Level Sentiment Analysis
TLDR
iFeel 2.0 is proposed, an online web system that implements 19 sentence-level sentiment analysis methods and allows users to easily label a dataset with all of them and can also be helpful for those interested in using sentiment analysis.
An evaluation of machine translation for multilingual sentence-level sentiment analysis
TLDR
Evaluating existing efforts proposed to do language specific sentiment analysis for English suggests that simply translating the input text on a specific language to English and then using one of the existing English methods can be better than the existing language specific efforts evaluated.
Investigating the Lack of Consensus Among Sentiment Analysis Tools
TLDR
Seven of the most well-regarded sentiment analysis tools are compared, and it is concluded that none of them is sufficiently reliable to be used on its own and combining them and relying on their results only when various tools reach an agreement seems to be a better option.
Commercial Sentiment Analysis Solutions: A Comparative Study
TLDR
This study shows that an independent, critical experimental analysis of sentiment analysis services can provide interesting insights into their general reliability and particular classification accuracy beyond marketing claims to critically compare solutions based on real-world data and analyze potential weaknesses and margins of error before making an investment.
Twitter as a Source for Time- and Domain-Dependent Sentiment Lexicons
TLDR
To complement traditional sentiment dictionaries, this work presents a system for lexicon expansion that extracts the most relevant terms from news and assesses their positive or negative score through Twitter, and shows that complementary lexicons increase the performance of three state-of-the-art sentiment systems.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 100 REFERENCES
Comparing and combining sentiment analysis methods
TLDR
A new method that combines existing approaches, providing the best coverage results and competitive agreement is developed and a free Web service called iFeel is presented, which provides an open API for accessing and comparing results across different sentiment methods for a given text.
iFeel 2.0: A Multilingual Benchmarking System for Sentence-Level Sentiment Analysis
TLDR
iFeel 2.0 is proposed, an online web system that implements 19 sentence-level sentiment analysis methods and allows users to easily label a dataset with all of them and can also be helpful for those interested in using sentiment analysis.
VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text
TLDR
Interestingly, using the authors' parsimonious rule-based model to assess the sentiment of tweets, it is found that VADER outperforms individual human raters, and generalizes more favorably across contexts than any of their benchmarks.
iFeel: a system that compares and combines sentiment analysis methods
TLDR
iFeel is a Web application that allows one to detect sentiments in any form of text including unstructured social media data and provides a single platform to compare the strengths and weaknesses of various sentiment analysis methods with a user friendly interface.
Sentiment Analysis and Opinion Mining
  • Lei Zhang, B. Liu
  • Computer Science
    Encyclopedia of Machine Learning and Data Mining
  • 2017
TLDR
This book is a comprehensive introductory and survey text that covers all important topics and the latest developments in the field with over 400 references and is suitable for students, researchers and practitioners who are interested in social media analysis in general and sentiment analysis in particular.
SemEval-2013 Task 2: Sentiment Analysis in Twitter
TLDR
Crowdourcing on Amazon Mechanical Turk was used to label a large Twitter training dataset along with additional test sets of Twitter and SMS messages for both subtasks, which included two subtasks: A, an expression-level subtask, and B, a message level subtask.
Language-Independent Twitter Sentiment Analysis
TLDR
The experiments show that the classification approach can be applied effectively for multiple languages without requiring extra effort per additional language, and the sentiment evaluation dataset publicly available.
Umigon: sentiment analysis for tweets based on terms lists and heuristics
TLDR
The “sentiment detection engine” of Umigon consists of 4 main parts, which are detailed below: detection of semantic features in the entire tweet, evaluation of hashtags, global heuristics and onomatopes.
A New ANEW: Evaluation of a Word List for Sentiment Analysis in Microblogs
TLDR
This work wanted to examine how well ANEW and other word lists performs for the detection of sentiment strength in microblog posts in comparison with a new word list specifically constructed for microblogs.
Distinguishing the Popularity between Topics: A System for Up-to-Date Opinion Retrieval and Mining in the Web
TLDR
An online system for opinion mining and retrieval that is able to discover up-to-date web pages on given topics using focused crawling agents, extract opinionated textual parts from web pages, and estimate their polarity using opinion mining agents is presented.
...
1
2
3
4
5
...