Intelligent document processing

@article{Esposito2005IntelligentDP,
  title={Intelligent document processing},
  author={Floriana Esposito and Stefano Ferilli and Teresa Maria Altomare Basile and Nicola Di Mauro},
  journal={Eighth International Conference on Document Analysis and Recognition (ICDAR'05)},
  year={2005},
  pages={1100-1104 Vol. 2}
}
Digital repositories raise the need for an effective and efficient retrieval of the stored material. In this paper, we propose the intensive application of intelligent techniques to the steps of document layout analysis, document image classification and understanding on digital documents. Specifically, the complex interrelation existing among layout components, that are fundamental to assign them the proper semantic role, suggest the exploitation of first-order representations in some learning… 

Figures from this paper

XML-based intelligent document technology and its development
TLDR
The article proposes a Web Service-based architecture of the intelligent document, analyzes the method of describing the cross-platform dynamic operation behavior, proposes a method to describe the dynamic operation and its operation interface, and discusses the development of the Intelligent document processing technology.
A Regeneration Based Lines Restoration Method
TLDR
This article proposes a novel regeneration based method for eliminating degradations of lines in binary document images and engineering drawings using reformed chain codes expression to detect degraded lines in images.
ePhilology: when the books talk to their readers 1
Writing, Phaedrus, has this strange quality, and is very like painting; for the creatures of painting stand like living beings, but if one asks them a question, they preserve a solemn silence. And so

References

SHOWING 1-10 OF 10 REFERENCES
Machine Learning for Intelligent Processing of Printed Documents
TLDR
This article proposes the application of machine learning techniques to acquire the specific knowledge required by an intelligent document processing system, named WISDOM++, that manages printed documents, such as letters and journals.
Incremental multistrategy learning for document processing
TLDR
This work presents the application of a multistrategy approach to some document processing tasks in an enhanced version of the incremental learning system INTHELEX, embedded in the system architecture of the EU project COLLATE.
Automated Labeling Algorithms for Biomedical Document Images
TLDR
A labeling module in the MARS, which automatically extract the bibliographic data in biomedical journal articles, is described, which shows relatively accurate labeling results.
Two Geometric Algorithms for Layout Analysis
  • T. Breuel
  • Computer Science
    Document Analysis Systems
  • 2002
This paper presents geometric algorithms for solving two key problems in layout analysis: finding a cover of the background whitespace of a document in terms of maximal empty rectangles, and finding
Spatial Relations, Minimum Bounding Rectangles, and Spatial Data Structures
TLDR
This paper describes topological and direction relations between region objects and study the spatial information that Minimum Bounding Rectangles convey about the actual objects they enclose, and applies the results in R-trees and their variations in order to minimize the number of disk accesses for queries involving topologicaland direction relations.
Twenty Years of Document Image Analysis in PAMI
  • G. Nagy
  • Computer Science
    IEEE Trans. Pattern Anal. Mach. Intell.
  • 2000
The contributions to document image analysis of 99 papers published in the IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) are clustered, summarized, interpolated, interpreted,
Solving the Multiple Instance Problem with Axis-Parallel Rectangles
Reasoning about Binary Topological Relations
TLDR
A new formalism is presented to reason about topological relations based upon the nine intersections of boundaries, interiors, and complements between two objects that is applicable as a foundation for an algebra over topological Relations.
Ground truth data for document image analysis Proc. of SDIUT'03
  • Ground truth data for document image analysis Proc. of SDIUT'03
  • 2003
Ground truth data for document image analysis Proc
  • of SDIUT'03, pp. 199-205,
  • 2003