• Publications
  • Influence
Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge
TLDR
A new question set, text corpus, and baselines assembled to encourage AI research in advanced question answering constitute the AI2 Reasoning Challenge (ARC), which requires far more powerful knowledge and reasoning than previous challenges such as SQuAD or SNLI. Expand
From 'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project
TLDR
Unprecedented success on the Grade 8 New York Regents Science Exam, where for the first time a system scores more than 90% on the exam's non-diagram, multiple choice (NDMC) questions, demonstrates that modern NLP methods can result in mastery on this task. Expand
Moving beyond the Turing Test with the Allen AI Science Challenge
Answering questions correctly from standardized eighth-grade science tests is itself a test of machine intelligence.
Behavioral Explanations Underlying the Lack of Trap Effectiveness for Small-Scale Management of Japanese Beetles (Coleoptera: Scarabaeidae)
TLDR
T traps containing a combination floral and synthetic pheromone lure are used to monitor and manage Japanese beetles, Popillia japonica Newman, and it is hypothesized that trap spillover is a result of arriving females not being as attracted to the precise location of the trap as they are to the general location itself. Expand
Environmental Conditions Affect Sperm Competition Risk in Japanese Beetles (Coleoptera: Scarabaeidae)
TLDR
The results demonstrate that environmental conditions can have a major effect on sperm competition risk and suggest that males would benefit greatly from being able to mitigate, either morphologically, physiologically, or behaviorally, the impact of those conditions. Expand
Potential and realized reproduction by different worker castes in queen-less and queen-right colonies of Pogonomyrmex badius
TLDR
Assessment of the reproductive potential of workers of each caste group, minors and majors, in the presence and absence of the queen, and whether males produced in natural queen-right colonies are derived from workers found no evidence that they do so, suggesting that worker reproduction is controlled via some social mechanism (self restraint, policing, or inhibition). Expand
Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge
TLDR
The ARC-DA dataset is presented, a direct-answer (“open response”, “freeform”) version of the ARC (AI2 Reasoning Challenge) multiple-choice dataset, one of the first DA datasets of natural questions that often require reasoning, and where appropriate question decompositions are not evident from the questions themselves. Expand
SUPP.AI: finding evidence for supplement-drug interactions
TLDR
SUPP.AI is an attempt to close the information gap on dietary supplements by making up-to-date evidence on SDIs more discoverable for researchers, clinicians, and consumers. Expand
Extracting evidence of supplement-drug interactions from literature
TLDR
An automated method for extracting evidence of supplement-drug interactions (SDIs) from scientific text is demonstrated, demonstrating that learning the task of DDI classification transfers successfully to the related problem of SDI classification. Expand