Creating a Scholarly Knowledge Graph from Survey Article Tables

  title={Creating a Scholarly Knowledge Graph from Survey Article Tables},
  author={Allard Oelen and Markus Stocker and S. Auer},
Due to the lack of structure, scholarly knowledge remains hardly accessible for machines. Scholarly knowledge graphs have been proposed as a solution. Creating such a knowledge graph requires manual effort and domain experts, and is therefore time-consuming and cumbersome. In this work, we present a human-in-the-loop methodology used to build a scholarly knowledge graph leveraging literature survey articles. Survey articles often contain manually curated and high-quality tabular information… 
Improving Access to Scientific Literature with Knowledge Graphs
A scholarly knowledge graph can be used to give a condensed overview on the state-of-the-art addressing a particular research quest, for example as a tabular comparison of contributions according to various characteristics of the approaches.
Leveraging a Federation of Knowledge Graphs to Improve Faceted Search in Digital Libraries
A methodology for improving a faceted search system on structured content by leveraging a federation of scholarly knowledge graphs and using dynamic facets on diverse data types, meaning that facets can change according to the user query.
SmartReviews: Towards Human- and Machine-actionable Representation of Review Articles
SmartReviews, a review authoring and publishing tool, specifically addressing the limitations of review articles is presented, which enables community-based authoring of living articles, leveraging a scholarly knowledge graph to provide machineactionable knowledge.
Digital Libraries at Times of Massive Societal Transition: 22nd International Conference on Asia-Pacific Digital Libraries, ICADL 2020, Kyoto, Japan, November 30 – December 1, 2020, Proceedings
This study aims to offer recommendations to the stakeholders of digital libraries for selecting the appropriate technique to build knowledge-graph-based systems for enhanced scholarly information organization by presenting a thorough empirical evaluation of eight Bert-based classification models.


Towards a Knowledge Graph Representing Research Findings by Semantifying Survey Articles
This article describes how surveys for research fields can be represented in a semantic way, resulting in a knowledge graph that describes the individual research problems, approaches, implementations and evaluations in a structured and comparable way and demonstrates the utility of the resulting knowledge graph.
Comparing Research Contributions in a Scholarly Knowledge Graph
This paper presents a workflow and system designed to compare research contributions in a scientific knowledge graph, implemented in the Open Research Knowledge Graph (ORKG), which enables researchers to find and compare related literature.
An Analysis of the Microsoft Academic Graph
The results show that the citation data and publication metadata correlate well with external datasets, and the MAG has very good coverage across different domains with a slight bias towards technical disciplines.
Open Research Knowledge Graph: Next Generation Infrastructure for Semantic Scholarly Knowledge
The first steps towards a knowledge graph based infrastructure that acquires scholarly knowledge in machine actionable form are presented thus enabling new possibilities for scholarly knowledge curation, publication and processing.
Crowdsourced semantic annotation of scientific publications and tabular data in PDF
The SemAnn approach enables collaborative annotation of text and tables in PDF documents, a format that is still the common denominator of publishing, thus maximising the potential user base and confirming the merit of the low-threshold annotation support for the crowd.
Unsupervised document structure analysis of digital scientific articles
This work has developed a processing pipeline that analyses the structure a PDF document using a number of unsupervised machine learning techniques and heuristics and shows that it outperforms a state-of-the-art system in terms of the quality of the extracted body text and table of contents.
A Large Public Corpus of Web Tables containing Time and Context Metadata
A large public corpus of Web tables which contains over 233 million tables and has been extracted from the July 2015 version of the CommonCrawl is presented to provide a common ground for evaluating Web table systems.
TableSeer: automatic table metadata extraction and searching in digital libraries
Overall, TableSeer eliminates the burden of manually extract table data from digital libraries and enables users to automatically examine tables, and proposes an extensive set of medium-independent metadata for tables that scientists and other users can adopt for representing table information.
Unleashing Tabular Content to Open Data: A Survey on PDF Table Extraction Methods and Tools
This paper aims at providing a structured and comprehensive overview of the research in tabular content extraction specifically from PDF documents as well as to provide an overview of most recent practical results in the literature.
Evaluating Reference String Extraction Using Line-Based Conditional Random Fields: A Case Study with German Language Publications
This work proposes a classification model that considers every line in a publication as a potential part of a reference string by applying line-based conditional random fields rather than constructing the graphical model based on individual words, dependencies and patterns that are typical in reference sections.