• Corpus ID: 207788596

Information Extraction from Text Regions with Complex Tabular Structure

  title={Information Extraction from Text Regions with Complex Tabular Structure},
  author={Kaixuan Zhang and Zejiang Shen and Jie Zhou and Melissa Dell},
Recent innovations have improved layout analysis of document images, significantly improving our ability to identify text and non-text regions. However, extracting information from within text regions remains quite challenging because the text region may have a complex structure. In this paper, we present a new dataset with complex tabular structure, and propose new methods to robustly retrieve information from the complex text region. 

Figures and Tables from this paper

CNN-Based Page Segmentation and Object Classification for Counting Population in Ottoman Archival Documentation

This study developed an automatic system for counting registered individuals and assigning them to populated places by using a CNN-based architecture and achieved promising results for classifying different types of objects and counting the individuals and assign them to populate places.

FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information Extraction

This work proposes FormNet, a structure-aware sequence model to mitigate the suboptimal serialization of forms, which designs Rich Attention that leverages the spatial relationship between tokens in a form for more precise attention score calculation and constructs Super-Tokens for each word.

Computerized Counting of Individuals in Ottoman Population Registers with Deep Learning

This study developed an automatic system for counting registered individuals and assigning them to populated places by using a CNN-based architecture and achieved promising results for classifying different types of objects.

Automatic Estimation of Age Distributions from the First Ottoman Empire Population Register Series by Using Deep Learning

This study applied deep learning-based layout analysis techniques to detect individuals in the first Ottoman population register series collected between the 1840s and the 1860s and trained a CNN model to recognize automatically detected ages of individuals and estimated age distributions of people from these historical documents.

Document Analysis Systems: 14th IAPR International Workshop, DAS 2020, Wuhan, China, July 26–29, 2020, Proceedings

Experiments on Chinese character recognition, Chinese text line recognition and fine-grained image classification achieve consistent improvement, proving that the regularization is beneficial to generalization and robustness of a recognition model.

Historical Natural Experiments: Bridging Economics and Economic History

The analysis of historical natural experiments has profoundly impacted economics research across fields. We trace the development and increasing application of the methodology, both from the



ICDAR2017 Competition on Recognition of Documents with Complex Layouts - RDCL2017

The results indicate that an innovative approach has a clear advantage but there is still a considerable need to develop robust methods that deal with layout challenges, especially with the non-textual content.

DeepLayout: A Semantic Segmentation Approach to Page Layout Analysis

This paper introduces semantic segmentation which is an end-to-end trainable deep neural network which takes only document image as input and predicts per pixel saliency maps and successfully brings RLSA into post-processing procedures to specify the boundaries.

dhSegment: A Generic Deep-Learning Approach for Document Segmentation

This paper proposes an open-source implementation of a CNN-based pixel-wise predictor coupled with task dependent post-processing blocks and shows that a single CNN-architecture can be used across tasks with competitive results.

An Introduction to Conditional Random Fields

This survey describes conditional random fields, a popular probabilistic method for structured prediction, and describes methods for inference and parameter estimation for CRFs, including practical issues for implementing large-scale CRFs.

MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

This work introduces two simple global hyper-parameters that efficiently trade off between latency and accuracy and demonstrates the effectiveness of MobileNets across a wide range of applications and use cases including object detection, finegrain classification, face attributes and large scale geo-localization.

Lecture 6.5—RmsProp: Divide the gradient by a running average of its recent magnitude

  • COURSERA: Neural Networks for Machine Learning,
  • 2012