• Publications
  • Influence
Sarcasm Detection on Czech and English Twitter
TLDR
This work is the first attempt at sarcasm detection in the Czech language by creating a large Czech Twitter corpus consisting of 7,000 manually-labeled tweets and evaluating two classifiers with various combinations of features on both the Czech and English datasets.
The Argument Reasoning Comprehension Task: Identification and Reconstruction of Implicit Warrants
TLDR
A methodology for reconstructing warrants systematically is developed and operationalized in a scalable crowdsourcing process, resulting in a freely licensed dataset with warrants for 2k authentic arguments from news comments.
Which argument is more convincing? Analyzing and predicting convincingness of Web arguments using bidirectional LSTM
TLDR
This work annotates a large datasets of 16k pairs of arguments over 32 topics and investigates whether the relation “A is more convincing than B” exhibits properties of total ordering; these findings are used as global constraints for cleaning the crowdsourced data.
What is the Essence of a Claim? Cross-Domain Claim Identification
TLDR
While the divergent conceptualization of claims in different datasets is indeed harmful to cross-domain classification, it is shown that there are shared properties on the lexical level as well as system configurations that can help to overcome these gaps.
Argumentation Mining in User-Generated Web Discourse
TLDR
The findings show that argumentation mining in user-generated Web discourse is a feasible but challenging task and offers the data, source codes, and annotation guidelines to the community under free licenses.
Argumentation Mining on the Web from Information Seeking Perspective
TLDR
It is argued that an annotation scheme for argumentation mining is a function of the task requirements and the corpus properties and it is found that the choice of the argument components to be annotated strongly depends on the register, the length of the document, and inherently on the literary devices and structures used for expressing argumentation.
Exploiting Debate Portals for Semi-Supervised Argumentation Mining in User-Generated Web Discourse
TLDR
Novel features that exploit clustering of unlabeled data from debate portals based on a word embeddings representation are proposed that significantly outperform several baselines in the cross-validation, cross-domain, and cross-register evaluation scenarios.
What makes a convincing argument? Empirical analysis and detecting attributes of convincingness in Web argumentation
TLDR
This article tackles a new challenging task in computational argumentation by annotating 26k explanations of convincingness of arguments in the given argument pair, such as their strengths or flaws, by creating a new crowd-sourced corpus containing 9,111 argument pairs.
SemEval-2018 Task 12: The Argument Reasoning Comprehension Task
TLDR
The analysis indicates that an inclusion of external knowledge is key to reasoning comprehension, and the results reveal the complexity of the task, with many approaches hardly improving over the random accuracy of about 0.5.
...
...