• Publications
  • Influence
Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter
TLDR
We provide a list of criteria founded in critical race theory, and use them to annotate a publicly available corpus of more than 16k tweets annotated for hate speech. Expand
  • 590
  • 108
  • PDF
Learning Whom to Trust with MACE
TLDR
We build a generative model of the annotation process that learns in an unsupervised fashion to identify which annotators are trustworthy and predict the correct underlying labels. Expand
  • 195
  • 45
  • PDF
Identifying Metaphorical Word Use with Tree Kernels
TLDR
A metaphor is a figure of speech that refers to one concept in terms of another, as in “He is such a sweet person”. Expand
  • 71
  • 15
  • PDF
Personality Traits on Twitter - or - How to Get 1, 500 Personality Tests in a Week
TLDR
We analyze which features are predictive of which personality traits, and present a novel corpus of 1.2M English tweets annotated with Myers-Briggs personality type and gender. Expand
  • 96
  • 10
  • PDF
Demographic Factors Improve Classification Performance
TLDR
We investigate the effect of including demographic information on performance in a variety of text-classification tasks in five languages. Expand
  • 110
  • 8
  • PDF
SemEval-2016 Task 10: Detecting Minimal Semantic Units and their Meanings (DiMSUM)
TLDR
This task combines the labeling of multiword expressions and supersenses (coarse-grained classes) in an explicit, yet broad-coverage paradigm for lexical semantics. Expand
  • 45
  • 8
  • PDF
What's in a Preposition? Dimensions of Sense Disambiguation for an Interesting Word Class
TLDR
We examine the parameters that must be considered in preposition disambiguation, namely context, features, and granularity. Expand
  • 44
  • 8
  • PDF
User Review Sites as a Resource for Large-Scale Sociolinguistic Studies
TLDR
We explore a large new data source, international review websites with user profiles. Expand
  • 55
  • 7
  • PDF
Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview
TLDR
An increasing number of works in natural language processing have addressed the effect of bias on the predicted outcomes, introducing mitigation techniques that act on different parts of the standard NLP pipeline (data and models). Expand
  • 38
  • 7
  • PDF
Multi-Task Learning for Mental Health using Social Media Text
TLDR
We introduce initial groundwork for estimating suicide risk and mental health in a deep learning framework. Expand
  • 53
  • 6
  • PDF
...
1
2
3
4
5
...