• Publications
  • Influence
Neural Motifs: Scene Graph Parsing with Global Context
TLDR
We investigate scene graph parsing: the task of producing graph representations of real-world images that provide semantic summaries of objects and relationships. Expand
SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference
TLDR
We introduce Adversarial Filtering (AF), a novel procedure that constructs a de-biased dataset by iteratively training an ensemble of stylistic classifiers and using them to filter the data. Expand
Defending Against Neural Fake News
TLDR
We present a new generative model for controllable text generation called Grover. Expand
From Recognition to Cognition: Visual Commonsense Reasoning
TLDR
We present a new reasoning engine, Recognition to Cognition Networks (R2C), that models the necessary layered inferences for grounding, contextualization, and reasoning. Expand
Multimodal Sentiment Intensity Analysis in Videos: Facial Gestures and Verbal Messages
TLDR
We introduce the first multimodal dataset with opinion-level sentiment intensity annotations; studying the prototypical interaction patterns between facial gestures and spoken words when inferring sentiment intensity; proposing a new computational representation, based on a language-gesture study; and evaluating the authors' proposed approach in a speaker-independent paradigm for sentiment intensity prediction. Expand
MOSI: Multimodal Corpus of Sentiment Intensity and Subjectivity Analysis in Online Opinion Videos
TLDR
We present the first opinion-level annotated corpus of sentiment and subjectivity analysis in online videos called Multimodal Opinion-level Sentiment Intensity dataset (MOSI). Expand
HellaSwag: Can a Machine Really Finish Your Sentence?
TLDR
We show that commonsense inference still proves difficult for even state-of-the-art models, by presenting HellaSwag, a new challenge dataset. Expand
PIQA: Reasoning about Physical Commonsense in Natural Language
TLDR
We introduce the task of physical commonsense reasoning and a corresponding benchmark dataset Physical Interaction: Question Answering or PIQA to evaluate language representations on their knowledge of physical Commonsense. Expand
Adversarial Filters of Dataset Biases
TLDR
We investigate one recently proposed approach, AFLite, which adversarially filters such dataset biases, as a means to mitigate the prevalent overestimation of machine performance. Expand
Zero-Shot Activity Recognition with Verb Attribute Induction
TLDR
In this paper, we investigate large-scale zero-shot activity recognition by modeling the visual and linguistic attributes of action verbs. Expand
...
1
2
3
...