• Publications
  • Influence
Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments
We address the problem of part-of-speech tagging for English data from the popular micro-blogging service Twitter. We develop a tagset, annotate data, develop features, and report tagging resultsExpand
  • 950
  • 109
  • PDF
From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series
We connect measures of public opinion measured from polls with sentiment measured from text. We analyze several surveys on consumer confidence and political opinion over the 2008 to 2009 period, andExpand
  • 1,722
  • 100
  • PDF
Improved Part-of-Speech Tagging for Online Conversational Text with Word Clusters
We consider the problem of part-of-speech tagging for informal, online conversational text. We systematically evaluate the use of large-scale unsupervised word clustering and new lexical features toExpand
  • 702
  • 90
  • PDF
Transition-Based Dependency Parsing with Stack Long Short-Term Memory
This work was sponsored in part by the U. S. Army Research Laboratory and the U. S. Army Research Office/nunder contract/grant number W911NF-10-1-0533, and in part by NSF CAREER grantExpand
  • 598
  • 76
  • PDF
A Latent Variable Model for Geographic Lexical Variation
The rapid growth of geotagged social media raises new computational possibilities for investigating geographic linguistic variation. In this paper, we present a multi-level generative model thatExpand
  • 618
  • 65
  • PDF
Recurrent Neural Network Grammars
We introduce recurrent neural network grammars, probabilistic models of sentences with explicit phrase structure. We explain efficient inference procedures that allow application to both parsing andExpand
  • 317
  • 65
  • PDF
A Simple, Fast, and Effective Reparameterization of IBM Model 2
We present a simple log-linear reparameterization of IBM Model 2 that overcomes problems arising from Model 1’s strong assumptions and Model 2’s overparameterization. Efficient inference, likelihoodExpand
  • 550
  • 63
  • PDF
A Discriminative Graph-Based Parser for the Abstract Meaning Representation
Abstract Meaning Representation (AMR) is a semantic formalism for which a grow- ing set of annotated examples is avail- able. We introduce the first approach to parse sentences into this representa-Expand
  • 228
  • 62
  • PDF
Annotation Artifacts in Natural Language Inference Data
Large-scale datasets for natural language inference are created by presenting crowd workers with a sentence (premise), and asking them to generate three new sentences (hypotheses) that it entails,Expand
  • 302
  • 61
  • PDF
The Web as a Parallel Corpus
Parallel corpora have become an essential resource for work in multilingual natural language processing. In this article, we report on our work using the STRAND system for mining parallel text on theExpand
  • 627
  • 60
  • PDF