• Publications
  • Influence
CORD-19: The Covid-19 Open Research Dataset
TLDR
We describe the mechanics of dataset construction, highlighting challenges and key design decisions, provide an overview of how CORD-19 has been used, and preview tools and upcoming shared tasks built around the dataset. Expand
  • 198
  • 31
  • PDF
Construction of the Literature Graph in Semantic Scholar
TLDR
We describe a deployed scalable system for organizing published scientific literature into a heterogeneous graph to facilitate algorithmic manipulation and discovery. Expand
  • 128
  • 17
  • PDF
A Dataset of Peer Reviews (PeerRead): Collection, Insights and NLP Applications
TLDR
We present the first public dataset of scientific peer reviews available for research purposes (PeerRead) providing an opportunity to study this important artifact. Expand
  • 60
  • 16
  • PDF
Overview of the TREC 2019 Fair Ranking Track
TLDR
The goal of the TREC Fair Ranking track was to develop a benchmark for evaluating retrieval systems in terms of fairness to different content providers in addition to classic notions of relevance. Expand
  • 5
  • PDF
Mitigating Biases in CORD-19 for Analyzing COVID-19 Literature
TLDR
A framework to examine biases in scientific document collections like CORD-19 by comparing their properties with those derived from the citation behaviors of the entire scientific community. Expand
  • 3