• Publications
  • Influence
AllenNLP Interpret: A Framework for Explaining Predictions of NLP Models
TLDR
We introduce AllenNLP Interpret, a flexible framework for interpreting NLP models. Expand
  • 37
  • 4
  • PDF
Evaluating Models' Local Decision Boundaries via Contrast Sets
TLDR
We propose a new annotation paradigm for NLP that helps to close systematic gaps in the test data. Expand
  • 32
  • 2
  • PDF
Evaluating NLP Models via Contrast Sets
TLDR
We propose a new annotation paradigm for NLP that helps to close systematic gaps in the test data. Expand
  • 51
  • 1
  • PDF
An Improved Neural Baseline for Temporal Relation Extraction
TLDR
This paper proposes a new neural system that achieves about 10% absolute improvement in accuracy over the previous best system (25% error reduction) on two benchmark datasets. Expand
  • 12
  • 1
  • PDF
Improving Generalization in Coreference Resolution via Adversarial Training
TLDR
In order for coreference resolution systems to be useful in practice, they must be able to generalize to new text. Expand
  • 6
  • PDF
Obtaining Faithful Interpretations from Compositional Neural Networks
TLDR
We introduce the concept of module-wise faithfulness, a systematic evaluation of faithfulness in neural module networks (NMNs) for visual and textual reasoning. Expand
  • 11
  • PDF
Correlation Clustering with Same-Cluster Queries Bounded by Optimal Cost
TLDR
We present algorithms for correlation clustering whose error and query bounds are parameterized by $C_{OPT}$ rather than by the number of clusters. Expand
  • 4
  • PDF
Evaluation of named entity coreference
TLDR
We introduce new metrics for evaluating named entity coreference that address these discrepancies and show that for the comparisons of competitive systems, standard coreference evaluations could give misleading results for this task. Expand
  • 4
  • PDF
Latent Compositional Representations Improve Systematic Generalization in Grounded Question Answering
TLDR
Answering questions that involve multi-step reasoning requires decomposing them and using the answers of intermediate steps to reach the final answer. Expand
  • 4
  • PDF
Analyzing Compositionality in Visual Question Answering
TLDR
We analyze the performance of one transformer model, LXMERT, on the NLVR2 and GQA datasets. Expand
  • 3
  • PDF
...
1
2
...