• Publications
  • Influence
Construction of the Literature Graph in Semantic Scholar
TLDR
This paper reduces literature graph construction into familiar NLP tasks, point out research challenges due to differences from standard formulations of these tasks, and report empirical results for each task. Expand
A Dataset of Peer Reviews (PeerRead): Collection, Insights and NLP Applications
TLDR
The first public dataset of scientific peer reviews available for research purposes (PeerRead v1) is presented and it is shown that simple models can predict whether a paper is accepted with up to 21% error reduction compared to the majority baseline. Expand
Structural Scaffolds for Citation Intent Classification in Scientific Publications
TLDR
This work proposes structural scaffolds, a multitask model to incorporate structural information of scientific papers into citations for effective classification of citation intents, which achieves a new state-of-the-art on an existing ACL anthology dataset with a 13.3% absolute increase in F1 score. Expand
Fact or Fiction: Verifying Scientific Claims
We introduce scientific claim verification, a new task to select abstracts from the research literature containing evidence that supports or refutes a given scientific claim, and to identifyExpand
SciREX: A Challenge Dataset for Document-Level Information Extraction
TLDR
SciREX is introduced, a document level IE dataset that encompasses multiple IE tasks, including salient entity identification and document level N-ary relation identification from scientific articles, and a neural model is developed as a strong baseline that extends previous state-of-the-art IE models to document-level IE. Expand
Apoptosis-related genes control autophagy and influence DENV-2 infection in the mosquito vector, Aedes aegypti.
TLDR
Evidence is provided that apoptosis-related genes are also involved in regulating autophagy, and that Aedronc may play an important role in DENV-2 infection success in Ae. Expand
Quantifying Sex Bias in Clinical Studies at Scale With Automated Data Extraction
TLDR
It is suggested that sex bias against female participants in clinical studies persists, but results differ when studies vs participants are the measurement units. Expand
Extracting a Knowledge Base of Mechanisms from COVID-19 Papers
TLDR
This work pursues the construction of a knowledge base of mechanisms—a fundamental concept across the sciences, which encompasses activities, functions and causal relations, ranging from cellular processes to economic impacts, by developing a broad, unified schema. Expand
MS2: Multi-Document Summarization of Medical Studies
TLDR
This work releases MS^2 (Multi-Document Summarization of Medical Studies), a dataset of over 470k documents and 20k summaries derived from the scientific literature that facilitates the development of systems that can assess and aggregate contradictory evidence across multiple studies, and is the first large-scale, publicly available multi-document summarization dataset in the biomedical domain. Expand
Improving the Accessibility of Scientific Documents: Current State, User Needs, and a System Solution to Enhance Scientific PDF Accessibility for Blind and Low Vision Users
TLDR
A small sample of papers was evaluated for successful extraction of display equations and categories of paper objects identified for evaluation along with the common errors seen for each category, including semantic categories and common extraction errors. Expand
...
1
2
...