Augmenting Scientific Papers with Just-in-Time, Position-Sensitive Definitions of Terms and Symbols

@article{Head2021AugmentingSP,
  title={Augmenting Scientific Papers with Just-in-Time, Position-Sensitive Definitions of Terms and Symbols},
  author={Andrew Head and Kyle Lo and Dongyeop Kang and Raymond Fok and Sam Skjonsberg and Daniel S. Weld and Marti A. Hearst},
  journal={Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems},
  year={2021}
}
  • Andrew Head, Kyle Lo, Marti A. Hearst
  • Published 29 September 2020
  • Computer Science
  • Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems
Despite the central importance of research papers to scientific progress, they can be difficult to read. Comprehension is often stymied when the information needed to understand a passage resides somewhere else—in another section, or in another paper. In this work, we envision how interfaces can bring definitions of technical terms and symbols to readers when and where they need them most. We introduce ScholarPhi, an augmented reading interface with four novel features: (1) tooltips that… 

Figures and Tables from this paper

Document-Level Definition Detection in Scholarly Documents: Existing Models, Error Analyses, and Future Directions
TLDR
This paper develops a new definition detection system, HEDDEx, that utilizes syntactic features, transformer encoders, and heuristic filters, and evaluates it on a standard sentence-level benchmark and notes that performance on the high-recall document-level task is much lower than in the standard evaluation approach.
PAWLS: PDF Annotation With Labels and Structure
TLDR
This paper presents PDF Annotation with Labels and Structure (PAWLS), a new annotation tool designed specifically for the PDF document format, particularly suited for mixed-mode annotation and scenarios in which annotators require extended context to annotate accurately.
CiteRead: Integrating Localized Citation Contexts into Scientific Paper Reading
TLDR
A novel paper reading experience that integrates relevant information about follow-on work directly into a paper, allowing readers to learn about newer papers and see how a paper is discussed by its citing papers in the context of the reference paper.
Paper Plain: Making Medical Research Papers Approachable to Healthcare Consumers with Natural Language Processing
TLDR
The study results suggest that guiding readers to relevant passages and providing plain language summaries, or “gists,” alongside the original paper content can make reading medical papers easier and give readers more confidence to approach these papers.
A Computational Inflection for Scientific Discovery
TLDR
The confluence of societal and computational trends suggests that computer science is poised to ignite a revolution in the scientific process itself.
From Who You Know to What You Read: Augmenting Scientific Recommendations with Implicit Social Networks
TLDR
This work introduces multiple new methods for augmenting recommendations with textual relevance messages that highlight knowledge-graph connections between recommended papers and a user’s publication and interaction history and develops a novel method that highlights connections with proxy authors of interest to users.
Hammer PDF: An Intelligent PDF Reader for Scientific Papers
TLDR
Hammer PDF is introduced, a novel intelligent PDF Reader for scientific papers that helps researchers, especially those who study computer science, to improve the efficiency and experience of reading scientific papers.
Math Augmentation: How Authors Enhance the Readability of Formulas using Novel Visual Design Practices
TLDR
A comprehensive inventory of the kinds of maugs that appear in math documents, and a detailed account of how authors’ tools ought to be redesigned to support efficient creation of math augmentations are contributed.
Recentering Reframing as an RtD Contribution: The Case of Pivoting from Accessible Web Tables to a Conversational Internet
TLDR
This paper describes a case where through a series of three design experiments the authors experienced a research pivot and described how their work to improve web-table navigation for screen-reader users broke their frame, leading to a new research program focused on constructing a conversational internet.
Revise and Resubmit: An Intertextual Model of Text-based Collaboration in Peer Review
TLDR
This work proposes the first intertextual model of text-based collaboration, which encompasses three major phenomena that make up a full iteration of the review-revise-and-resubmit cycle: pragmatic tagging, linking and long-document version alignment.
...
1
2
...

References

SHOWING 1-10 OF 120 REFERENCES
Beyond paper: supporting active reading with free form digital ink annotations
TLDR
The XLibris “active reading machine” demonstrates that computers can help active readers organize and find information while retaining many of the advantages of reading on paper.
Enriching a document collection by integrating information extraction and PDF annotation
TLDR
A high-accuracy citation extraction algorithm which significantly improves on earlier reported techniques, and a technique for integrating PDF processing with a conventional text-stream based information extraction pipeline.
Mathematical Language Processing Project
TLDR
Two approaches to discover identifier-definition tuples are compared and a simple pattern matching approach is used and a approach that uses part-of-speech tag based distances as well as sentence positions to calculate identifier- definition probabilities is presented.
PaperQuest: A Visualization Tool to Support Literature Review
TLDR
PaperQuest is presented, a visualization tool that supports efficient reading decisions, by only displaying the information useful at a given step of the review, in order to find and sort papers that are likely to be relevant to users.
SideNoter: Scholarly Paper Browsing System based on PDF Restructuring and Text Annotation
TLDR
This system provides ways to extract natural language sentences from PDF files together with their logical structures, and also to map arbitrary textual spans to their corresponding regions on page images, and is planned to make widely available to NLP researchers.
The reader's helper: a personalized document reading environment
TLDR
A anew document reading environment is introduced called the Readers HelperTM, which supports the reading of electronic and paper documents and produces arelevance score for each of the readers topics of interest, thereby helping the reader decide whether the document is actually worthskimming or reading.
Elastic Documents: Coupling Text and Tables through Contextual Visualizations for Enhanced Document Reading
TLDR
This paper parse the text content and data tables, cross-link the components using a keyword-based matching algorithm, and generate on-demand visualizations based on the reader's current focus within a document that couples text content with data tables contained in the document.
Mining Scientific Terms and their Definitions: A Study of the ACL Anthology
TLDR
DefMiner is presented, a supervised sequence labeling system that identifies scientific terms and their accompanying definitions and achieves 85% F1 on a Wikipedia benchmark corpus, significantly improving the previous state-of-the-art by 8%.
Stencils-based tutorials: design and evaluation
TLDR
Stencils, an interaction technique for presenting tutorials that uses translucent colored stencils containing holes that direct the user's attention to the correct interface component and prevent the user from interacting with other components, is introduced.
Hypertext: An Introduction and Survey
TLDR
A survey of existing hypertext systems, their applications, and their design is both an introduction to the world of hypertext and a survey of some of the most important design issues that go into fashioning a hypertext environment.
...
1
2
3
4
5
...