Impact of Crowdsourcing OCR Improvements on Retrievability Bias

Abstract

Digitized document collections often suffer from OCR errors that may impact a document's readability and retrievability. We studied the effects of correcting OCR errors on the retrievability of documents in a historic newspaper corpus of a digital library. We computed retrievability scores for the uncorrected documents using queries from the library's… (More)
DOI: 10.1145/3197026.3197046

9 Figures and Tables

Topics

  • Presentations referencing similar topics