PDFFigures 2.0: Mining figures from research papers

@article{Clark2016PDFFigures2M,
  title={PDFFigures 2.0: Mining figures from research papers},
  author={Christopher Andreas Clark and Santosh Kumar Divvala},
  journal={2016 IEEE/ACM Joint Conference on Digital Libraries (JCDL)},
  year={2016},
  pages={143-152}
}
Figures and tables are key sources of information in many scholarly documents. However, current academic search engines do not make use of figures and tables when semantically parsing documents or presenting document summaries to users. To facilitate these applications we develop an algorithm that extracts figures, tables, and captions from documents called “PDFFigures 2.0.” Our proposed approach analyzes the structure of individual pages by detecting captions, graphical elements, and chunks of… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-10 OF 30 CITATIONS

Extracting Scientific Figures with Distantly Supervised Neural Networks

VIEW 13 EXCERPTS
CITES BACKGROUND & METHODS
HIGHLY INFLUENCED

Data-Driven Recognition and Extraction of PDF Document Elements

VIEW 7 EXCERPTS
CITES METHODS, RESULTS & BACKGROUND
HIGHLY INFLUENCED

SPaSe - Multi-Label Page Segmentation for Presentation Slides

VIEW 5 EXCERPTS
CITES BACKGROUND
HIGHLY INFLUENCED

DLPaper2Code: Auto-generation of Code from Deep Learning Research Papers

VIEW 2 EXCERPTS
CITES BACKGROUND & METHODS
HIGHLY INFLUENCED

Mining Faces from Biomedical Literature using Deep Learning

VIEW 1 EXCERPT
CITES METHODS
HIGHLY INFLUENCED

CiteSeerX: 20 years of service to scholarly big data

VIEW 1 EXCERPT
CITES METHODS

References

Publications referenced by this paper.