• Corpus ID: 3218493

Sarcasm Detection on Czech and English Twitter

@inproceedings{Ptcek2014SarcasmDO,
  title={Sarcasm Detection on Czech and English Twitter},
  author={Tom{\'a}s Pt{\'a}cek and Ivan Habernal and Jun Hong},
  booktitle={COLING},
  year={2014}
}
This paper presents a machine learning approach to sarcasm detection on Twitter in two languages – English and Czech. [] Key Result Experiments show that our language-independent approach significantly outperforms adapted state-of-the-art methods in English (F-measure 0.947) and also represents a strong baseline for further research in Czech (F-measure 0.582).

Tables from this paper

Sarcasm Annotation and Detection in Tweets
TLDR
A comparison of three sets of tweets marked for sarcasm, two annotated manually and one annotated using the common strategy of relying on the authors correctly using hashtags to mark sarcasm indicates that using hashtagged is not a reliable approach to creating Twitter sarcasm corpora.
Sentence-Level Sarcasm Detection in English and Filipino Tweets
TLDR
This study focused on sentence-level sarcasm detection that includes lexical, pragmatic, hyperbole, quotations and punctuation marks as the features extracted in the tweets.
A Corpus of English-Hindi Code-Mixed Tweets for Sarcasm Detection
TLDR
This work presents the first English-Hindi code-mixed dataset of tweets marked for presence of sarcasm and irony where each token is also annotated with a language tag and presents a baseline su- pervised classification system developed using the same dataset.
Sarcasm detection in social media posts: A division of sentiment analysis
TLDR
The paper is based on the re-search done to find methods to detect sarcasm in the social media posts and a comparison between different machine learning algorithms to choose for sarcasm detection.
Arabic sarcasm detection in Twitter
TLDR
A classifier model that detects Arabic-sarcasm tweets by classifying them as sarcastic by setting some features that may declare a tweet as sarcastic using Weka is proposed.
Automatic Sarcasm Detection in Twitter Messages
TLDR
A system for automatic sarcasm detection in Twitter messages is presented and a dataset of tweets manually annotated with respect to the presence of sarcasm was built; the result was very similar to that of a previously made set, and both of them showed considerable deviation from automatic annotation.
From Arabic Sentiment Analysis to Sarcasm Detection: The ArSarcasm Dataset
TLDR
ArSarcasm is an Arabic sarcasm detection dataset, which was created through the reannotation of available Arabic sentiment analysis datasets, which contains 10,547 tweets, 16% of which are sarcastic.
Natural language processing based features for sarcasm detection: An investigation using bilingual social media texts
TLDR
A feature extraction process is proposed to detect sarcasm using bilingual texts; more specifically public comments on economic related posts on Facebook.
Sarcasm Detection in Twitter - Performance Impact while using Data Augmentation: Word Embeddings
TLDR
A contextual model for sarcasm identification in twitter using RoBERTa, and augmenting the dataset by applying Global Vector representation (GloVe) for the construction of word embedding and context learning to generate more data and balancing the dataset is proposed.
...
...

References

SHOWING 1-10 OF 37 REFERENCES
Sentiment Analysis in Czech Social Media Using Supervised Machine Learning
TLDR
An in-depth research of machine learning methods for sentiment analysis of Czech social media using different pre-processing techniques and employ various features and classifiers, which will extend the current sentiment analysis research to another family of languages and encourage competition which potentially leads to the production of high-end commercial solutions.
Semi-supervised recognition of sarcastic sentences in Twitter and Amazon
TLDR
This paper experiments with semi-supervised sarcasm identification on two very different data sets: a collection of 5.9 million tweets collected from Twitter, and aCollection of 66000 product reviews from Amazon.
The perfect solution for detecting sarcasm in tweets #not
TLDR
It is hypothesize that explicit markers such as hashtags are the digital extralinguistic equivalent of nonverbal expressions that people employ in live interaction when conveying sarcasm, and shows that sarcasm is often signalled by hyperbole, using intensifiers and exclamations; in contrast, non-hyperbolic sarcastic messages often receive an explicit marker.
Some clues on irony detection in tweets
TLDR
This study analyzes and implements some patterns that may be associated to ironic statements in Brazilian Portuguese that are based on a pragmatic view, in which the context is organized as the reader is in touch with the text.
Identifying Sarcasm in Twitter: A Closer Look
TLDR
This work reports on a method for constructing a corpus of sarcastic Twitter messages in which determination of the sarcasm of each message has been made by its author and uses this reliable corpus to compare sarcastic utterances in Twitter to utterances that express positive or negative attitudes without sarcasm.
A multidimensional approach for detecting irony in Twitter
TLDR
A new model of irony detection that is assessed along two dimensions: representativeness and relevance is constructed, and initial results are largely positive, and provide valuable insights into the figurative issues facing tasks such as sentiment analysis, assessment of online reputations, or decision making.
ICWSM - A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews
TLDR
A novel Semi-supervised Algorithm for Sarcasm Identification that recognizes sarcastic sentences in product reviews and speculate on the motivation for using sarcasm in online communities and social networks is presented.
Sarcasm as Contrast between a Positive Sentiment and Negative Situation
TLDR
This work develops a sarcasm recognizer that automatically learns lists of positive sentiment phrases and negative situation phrases from sarcastic tweets and shows that identifying contrasting contexts using the phrases learned through bootstrapping yields improved recall for sarcasm recognition.
From humor recognition to irony detection: The figurative language of social media
Unsupervised Improving of Sentiment Analysis Using Global Target Context
TLDR
This work assumes that sentiment labels of reviews about the same target are often consistent in some way and model this consistency by Dirichlet distribution over sentiment labels and uses it together with Maximum entropy classifier to gain significant improvement.
...
...