Corpus ID: 16285667

Looking Beyond Text: Extracting Figures, Tables and Captions from Computer Science Papers

@inproceedings{Clark2015LookingBT,
  title={Looking Beyond Text: Extracting Figures, Tables and Captions from Computer Science Papers},
  author={Christopher Clark and S. Divvala},
  booktitle={AAAI Workshop: Scholarly Big Data},
  year={2015}
}
Identifying and extracting figures and tables along with their captions from scholarly articles is important both as a way of providing tools for article summarization, and as part of larger systems that seek to gain deeper, semantic understanding of these articles. [...] Key Method This method can extract a wide variety of figures because it does not make strong assumptions about the format of the figures embedded in the document, as long as they can be differentiated from the main article's text.Expand
PDFFigures 2.0: Mining figures from research papers
Figure and caption extraction from biomedical documents
Scalable algorithms for scholarly figure mining and semantics
Line-items and table understanding in structured documents
Prediction of importance of figures in scholarly papers
  • Yui Kita, J. Rekimoto
  • Computer Science
  • 2017 Twelfth International Conference on Digital Information Management (ICDIM)
  • 2017
Table Understanding in Structured Documents
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 15 REFERENCES
Figure Metadata Extraction from Digital Documents
An Automatic System for Extracting Figures and Captions in Biomedical PDF Documents
On methods and tools of table detection, extraction and annotation in PDF documents
Automatic Extraction of Figures from Scientific Publications in High-Energy Physics
CiteSeerX: AI in a Digital Library Search Engine
Yale Image Finder (YIF): a new search engine for retrieving biomedical images
A survey of table recognition
Large Graph Construction for Scalable Semi-Supervised Learning
The Pascal Visual Object Classes (VOC) Challenge
...
1
2
...