Detecting figures and part labels in patents: competition-based development of graphics recognition algorithms

@article{Riedl2016DetectingFA,
  title={Detecting figures and part labels in patents: competition-based development of graphics recognition algorithms},
  author={Christoph Riedl and R. Zanibbi and Marti A. Hearst and Siyu Zhu and Michael Menietti and Jason Crusan and Ivan Metelsky and K. Lakhani},
  journal={International Journal on Document Analysis and Recognition (IJDAR)},
  year={2016},
  volume={19},
  pages={155-172}
}
Most United States Patent and Trademark Office (USPTO) patent documents contain drawing pages which describe inventions graphically. By convention and by rule, these drawings contain figures and parts that are annotated with numbered labels but not with text. As a result, readers must scan the document to find the description of a given part label. To make progress toward automatic creation of ‘tool-tips’ and hyperlinks from part labels to their associated descriptions, the USPTO hosted a… Expand
Text Detection in Natural Scenes and Technical Diagrams with Convolutional Feature Learning and Cascaded Classification
TLDR
A text detection system to analyze and utilize visual information in a data driven, automatic and intelligent way, including patch-based coarse-to-fine detection (Text-Conv), connected component extraction using region growing, and graph-based word segmentation (Word-Graph). Expand
Novel mixed-encoding for forecasting patent grant duration
TLDR
This work builds upon the existing study on patent grant duration prediction by devising a novel methodology of encoding the data using a combination of augmented one-hot encoding and label-encoding, and identifies some of the important factors which impact the decision on grant duration of patent applications using the raw data from the Indian Patent Office. Expand
Diagram Image Retrieval and Analysis: Challenges and Opportunities
TLDR
This paper investigates recent research on diagram image retrieval and analysis, with an emphasis on methods using content-based image retrieval (CBIR), textures, shapes, topology and geometry, and point out future research opportunities from technical and application perspectives. Expand
Digital Art Feature Association Mining Based on the Machine Learning Algorithm
  • Zhiying Wu, Yuan Chen
  • Computer Science
  • Complex.
  • 2021
With the development of computer hardware and software, digital art is a new discipline. It uses computers and digital technology as tools to perform artistic expression. It can be expanded toExpand
Machine Learning Approaches to Facial and Text Analysis: Discovering CEO Oral Communication Styles
We demonstrate how a novel synthesis of three methods — (1) unsupervised topic modeling of text data to generate new measures of textual variance, (2) sentiment analysis of text data, and (3)Expand
The state-of-the-art on Intellectual Property Analytics (IPA): A literature review on artificial intelligence, machine learning and deep learning methods for analysing intellectual property (IP) data
TLDR
This literature review presents the state of the art in intellectual property analytics by reviewing 57 recent articles and follows a narrative approach with search strategy, where the state-of-the-art is presented. Expand
Mining BIG Data : The Future of Exploration Targeting Using Machine Learning
  • 2017
Exploration expenditures will increasingly be shifting to deeper domains and blind targets as the “easy discoveries” are progressively exhausted. The mining industry’s discovery rate has fallenExpand
Protection through participation: Crowdsourced tap water quality monitoring for enhanced public health.
TLDR
This project helps inform future public engagement with water quality monitoring, create new knowledge about the influence of personal motivations for participation, and provide recommendations to help increase awareness of water quality issues. Expand

References

SHOWING 1-10 OF 98 REFERENCES
Image search in patents: a review
  • Naeem Bhatti, A. Hanbury
  • Computer Science
  • International Journal on Document Analysis and Recognition (IJDAR)
  • 2012
TLDR
The importance, requirements, and challenges of a patent image retrieval system are introduced and an overview of the algorithms developed for the retrieval and analysis of CAD and technical drawings, diagrams, data flow diagrams, circuit diagrams,Data charts, flowcharts, plots, and symbol recognition are presented. Expand
Text line extraction in graphical documents using background and foreground information
  • P. Roy, U. Pal, J. Lladós
  • Computer Science
  • International Journal on Document Analysis and Recognition (IJDAR)
  • 2011
TLDR
A novel method to segment such text lines and the method is based on the foreground and background information of the text components, using a water reservoir concept to effectively utilize the background information. Expand
Evaluating structural pattern recognition for handwritten math via primitive label graphs
TLDR
This work defines new metrics obtained by Hamming distances over label graphs, which allow classification, segmentation and parsing errors to be characterized separately, or using a single measure, at the primitive level. Expand
Whole-Book Recognition
  • Pingping Xiu, H. Baird
  • Computer Science, Medicine
  • IEEE Transactions on Pattern Analysis and Machine Intelligence
  • 2012
TLDR
An algorithm which expects to be initialized with approximate iconic and linguistic models-derived from OCR results and dictionaries-and then, guided entirely by evidence internal to the test set, corrects the models which, in turn, yields higher recognition accuracy. Expand
Patent Retrieval
TLDR
This survey of work done on patent data in relation to Information Retrieval in the last 20–25 years is a survey of the sources of difficulty and the existing document processing and retrieval methods of the domain, and provides a motivation for further research in the area. Expand
DMOS, a generic document recognition method: application to table structure analysis in a general and in a specific way
  • Bertrand Coüasnon
  • Computer Science
  • International Journal of Document Analysis and Recognition (IJDAR)
  • 2005
TLDR
The Description and Modification of Segmentation (DMOS) method is proposed, which is made of a new grammatical language (Enhanced Position Formalism—EPF) and an associated parser able to deal with noise, which has been successfully used to produce recognition systems on musical scores, mathematical formulae and even tennis courts in videos. Expand
Flowchart recognition for non-textual information retrieval in patent search
TLDR
This paper presents a flowchart recognition method aimed at achieving a structured representation of flowchart images that can be further queried semantically, and reports the obtained results on this dataset. Expand
Detection of Dimension Sets in Engineering Drawings
TLDR
A new rule-based text/graphics separation algorithm and a model-based procedure for detecting arrowheads in any orientation have been developed for detecting dimension sets in engineering drawings drawn to ANSI drafting standards. Expand
A Survey of Methods and Strategies in Character Segmentation
TLDR
H holistic approaches that avoid segmentation by recognizing entire character strings as units are described, including methods that partition the input image into subimages, which are then classified. Expand
Machine printed text and handwriting identification in noisy document images
TLDR
This paper addresses the problem of the identification of text in noisy document images by treating noise as a separate class and model noise based on selected features. Expand
...
1
2
3
4
5
...