• Publications
  • Influence
Construction of the Literature Graph in Semantic Scholar
TLDR
We describe a deployed scalable system for organizing published scientific literature into a heterogeneous graph to facilitate algorithmic manipulation and discovery. Expand
  • 128
  • 17
  • PDF
From 'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project
TLDR
This paper reports unprecedented success on the Grade 8 New York Regents Science Exam, where for the first time a system scores more than 90% on the exam's multiple choice (NDMC) questions. Expand
  • 21
  • 2
  • PDF
IKE - An Interactive Tool for Knowledge Extraction
TLDR
We present IKE, a new extraction tool that performs fast, interactive bootstrapping to develop high-quality extraction patterns for targeted relations. Expand
  • 21
  • 1
  • PDF
A Simple Yet Strong Pipeline for HotpotQA
TLDR
We show that a simple model, QUARK (see Fig. 1), that first identifies relevant sentences from each paragraph independent of other paragraphs, is surprisingly powerful on this task. Expand
  • 8
  • 1
  • PDF
Documenting the English Colossal Clean Crawled Corpus
As language models are trained on ever more text, researchers are turning to some of the largest corpora available. Unlike most other types of datasets in NLP, large unlabeled text corpora are oftenExpand