Learn More
This paper presents a methodology for automatic identification of bibliographic data elements from the title pages of books. Also enumerates the various steps like scanning the title pages, running Optical Character Recognition (OCR) software, generating HTML files out of title pages and applying heuristics to identify the bibliographic data elements. Much(More)
  • 1