• Publications
  • Influence
RoBERTa: A Robustly Optimized BERT Pretraining Approach
TLDR
We present a replication study of BERT pretraining (Devlin et al., 2019) that carefully measures the impact of many key hyperparameters and training data size. Expand
  • 2,902
  • 922
  • PDF
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
TLDR
We present BART, a denoising autoencoder for pretraining sequence-to-sequence models. Expand
  • 629
  • 162
  • PDF
HMDB: the Human Metabolome Database
TLDR
The Human Metabolome Database (HMDB) is currently the most complete and comprehensive curated collection of human metabolite and human metabolism data in the world. Expand
  • 2,032
  • 116
  • PDF
End-to-end Neural Coreference Resolution
TLDR
We introduce the first end-to-end coreference resolution model and show that it significantly outperforms all previous work without using a syntactic parser or hand-engineered mention detector. Expand
  • 413
  • 95
  • PDF
Hierarchical Neural Story Generation
TLDR
We tackle the challenges of story-telling with a hierarchical model, which first generates a sentence called the prompt describing the topic for the story, and then conditions on this prompt when generating the story. Expand
  • 335
  • 77
  • PDF
Deep Semantic Role Labeling: What Works and What's Next
TLDR
We introduce a new deep learning model for semantic role labeling (SRL) that significantly improves the state of the art, along with detailed analyses to reveal its strengths and limitations. Expand
  • 307
  • 63
  • PDF
Deal or No Deal? End-to-End Learning of Negotiation Dialogues
TLDR
We gather a large dataset of human-human negotiations on a multi-issue bargaining task, where agents who cannot observe each other's reward functions must reach an agreement (or a deal) via natural language dialogue. Expand
  • 191
  • 34
  • PDF
Question-Answer Driven Semantic Role Labeling: Using Natural Language to Annotate Natural Language
TLDR
This paper introduces the task of questionanswer driven semantic role labeling (QA-SRL), where question-answer pairs are used to represent predicate-argument structure. Expand
  • 117
  • 25
  • PDF
Cross-lingual Transfer Learning for Multilingual Task Oriented Dialog
TLDR
We present a new data set of 57k annotated utterances in English (43k), Spanish (8.6k) and Thai (5k) across the domains weather, alarm, and reminder. Expand
  • 80
  • 21
  • PDF
A* CCG Parsing with a Supertag-factored Model
TLDR
We introduce a new CCG parsing model which is factored on lexical category assignments. Expand
  • 115
  • 20
  • PDF
...
1
2
3
4
5
...