• Publications
  • Influence
Deep learning for pharmacovigilance: recurrent neural network architectures for labeling adverse drug reactions in Twitter posts
Objective Social media is an important pharmacovigilance data source for adverse drug reaction (ADR) identification. Human review of social media data is infeasible due to data quantity, thus naturalExpand
  • 76
  • 10
Mapping the Paraphrase Database to WordNet
WordNet has facilitated important research in natural language processing but its usefulness is somewhat limited by its relatively small lexical coverage. The Paraphrase Database (PPDB) covers 650Expand
  • 3
  • 2
  • PDF
Effectively Crowdsourcing Radiology Report Annotations
Crowdsourcing platforms are a popular choice for researchers to gather text annotations quickly at scale. We investigate whether crowdsourced annotations are useful when the labeling task requiresExpand
  • 7
  • 1
  • PDF
Crowd control: Effectively utilizing unscreened crowd workers for biomedical data annotation
Annotating unstructured texts in Electronic Health Records data is usually a necessary step for conducting machine learning research on such datasets. Manual annotation by domain experts providesExpand
  • 12
  • 1
Learning Scalar Adjective Intensity from Paraphrases
Adjectives like warm, hot, and scalding all describe temperature but differ in intensity. Understanding these differences between adjectives is a necessary part of reasoning about natural language.Expand
  • 5
  • 1
  • PDF
Clustering Paraphrases by Word Sense
Automatically generated databases of English paraphrases have the drawback that they return a single list of paraphrases for an input word or phrase. This means that all senses of polysemous wordsExpand
  • 13
  • PDF
The Language of Place: Semantic Value from Geospatial Context
There is a relationship between what we say and where we say it. Word embeddings are usually trained assuming that semantically-similar words occur within the same textual contexts. We investigateExpand
  • 12
  • PDF
Word Sense Filtering Improves Embedding-Based Lexical Substitution
The role of word sense disambiguation in lexical substitution has been questioned due to the high performance of vector space models which propose good substitutes without explicitly accounting forExpand
  • 7
  • PDF
Automated Paraphrase Lattice Creation for HyTER Machine Translation Evaluation
We propose a variant of a well-known machine translation (MT) evaluation metric, HyTER (Dreyer and Marcu, 2012), which exploits reference translations enriched with meaning equivalent expressions.Expand
  • 5
  • PDF
A Comparison of Context-sensitive Models for Lexical Substitution
Word embedding representations provide good estimates of word meaning and give state-of-the art performance in semantic tasks. Embedding approaches differ as to whether and how they account for theExpand
  • 5
  • PDF