• Publications
  • Influence
Proceedings of NIPS
Diffusion of Lexical Change in Social Media
TLDR
Using a latent vector autoregressive model to aggregate across thousands of words, high-level patterns in diffusion of linguistic change over the United States are identified and support for prior arguments that focus on geographical proximity and population size is offered. Expand
Proceedings of EMNLP
Random Feature Attention
TLDR
RFA, a linear time and space attention that uses random feature methods to approximate the softmax function, is proposed and explored, showing that RFA is competitive in terms of both accuracy and efficiency on three long text classification datasets. Expand
The Right Tool for the Job: Matching Model and Instance Complexities
TLDR
This work proposes a modification to contextual representation fine-tuning which allows for an early (and fast) “exit” from neural network calculations for simple instances, and late (and accurate) exit for hard instances during inference. Expand
Evaluating Models’ Local Decision Boundaries via Contrast Sets
TLDR
A more rigorous annotation paradigm for NLP that helps to close systematic gaps in the test data, and recommends that the dataset authors manually perturb the test instances in small but meaningful ways that (typically) change the gold label, creating contrast sets. Expand
Bayesian Optimization of Text Representations
TLDR
This work applies a sequential model-based optimization technique and shows that this method makes standard linear models competitive with more sophisticated, expensive state-of-the-art methods based on latent variable models or neural networks on various topic classification and sentiment analysis problems. Expand
Shortformer: Better Language Modeling using Shorter Inputs
TLDR
This work shows that initially training the model on short subsequences, before moving on to longer ones, both reduces overall training time and gives a large improvement in perplexity and improves perplexity on WikiText-103, without adding any parameters. Expand
The Usable Privacy Policy Project : Combining Crowdsourcing , Machine Learning and Natural Language Processing to Semi-Automatically Answer Those Privacy Questions Users Care About
Natural language privacy policies have become a de facto standard to address expectations of “notice and choice” on the Web. However, users generally do not read these policies and those who do readExpand
...
1
2
3
4
5
...