Grammar-Based Recognition of Documentary Forms and Extraction of Metadata

Abstract

Metadata extraction is a critical aspect of ingestion of collections into digital archives and libraries. A method for automatically recognizing document types and extracting metadata from digital records has been developed. The method is based on a method for automatically annotating semantic categories such as person’s names, job titles, dates, and postal… (More)
DOI: 10.2218/ijdc.v5i1.149

Topics

7 Figures and Tables

Cite this paper

@article{Underwood2010GrammarBasedRO, title={Grammar-Based Recognition of Documentary Forms and Extraction of Metadata}, author={William Underwood}, journal={IJDC}, year={2010}, volume={5}, pages={148-159} }