Share This Author
Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge
A new question set, text corpus, and baselines assembled to encourage AI research in advanced question answering constitute the AI2 Reasoning Challenge (ARC), which requires far more powerful knowledge and reasoning than previous challenges such as SQuAD or SNLI.
From 'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project
Success is reported on the Grade 8 New York Regents Science Exam, where for the first time a system scores more than 90 percent on the exam’s nondiagram, multiple choice (NDMC) questions, demonstrating that modern natural language processing methods can result in mastery on this task.
Moving beyond the Turing Test with the Allen AI Science Challenge
- Carissa Schoenick, Peter Clark, Oyvind Tafjord, Peter D. Turney, Oren Etzioni
- EducationCommun. ACM
- 14 April 2016
Answering questions correctly from standardized eighth-grade science tests is itself a test of machine intelligence.
Behavioral Explanations Underlying the Lack of Trap Effectiveness for Small-Scale Management of Japanese Beetles (Coleoptera: Scarabaeidae)
- P. Switzer, P. C. Enstrom, Carissa Schoenick
- Environmental ScienceJournal of economic entomology
- 28 May 2009
T traps containing a combination floral and synthetic pheromone lure are used to monitor and manage Japanese beetles, Popillia japonica Newman, and it is hypothesized that trap spillover is a result of arriving females not being as attracted to the precise location of the trap as they are to the general location itself.
Environmental Conditions Affect Sperm Competition Risk in Japanese Beetles (Coleoptera: Scarabaeidae)
The results demonstrate that environmental conditions can have a major effect on sperm competition risk and suggest that males would benefit greatly from being able to mitigate, either morphologically, physiologically, or behaviorally, the impact of those conditions.
Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge
The ARC-DA dataset is presented, a direct-answer (“open response”, “freeform”) version of the ARC (AI2 Reasoning Challenge) multiple-choice dataset, one of the first DA datasets of natural questions that often require reasoning, and where appropriate question decompositions are not evident from the questions themselves.
Potential and realized reproduction by different worker castes in queen-less and queen-right colonies of Pogonomyrmex badius
Assessment of the reproductive potential of workers of each caste group, minors and majors, in the presence and absence of the queen, and whether males produced in natural queen-right colonies are derived from workers found no evidence that they do so, suggesting that worker reproduction is controlled via some social mechanism (self restraint, policing, or inhibition).
SUPP.AI: finding evidence for supplement-drug interactions
SUPP.AI is an attempt to close the information gap on dietary supplements by making up-to-date evidence on SDIs more discoverable for researchers, clinicians, and consumers.
Extracting evidence of supplement-drug interactions from literature
An automated method for extracting evidence of supplement-drug interactions (SDIs) from scientific text is demonstrated, demonstrating that learning the task of DDI classification transfers successfully to the related problem of SDI classification.
Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text
This work proposes models to play Iconary, a collaborative game of drawing and guessing based on Pictionary, that poses a novel challenge for the research community and proposes models that are skillful players and able to employ world knowledge in language models toplay with words unseen during training.