• Publications
  • Influence
Linked data quality of DBpedia, Freebase, OpenCyc, Wikidata, and YAGO
TLDR
Data quality criteria according to which KGs can be analyzed and analyze and compare the above mentioned KGs are provided and a framework for finding the most suitable KG for a given setting is proposed.
The Microsoft Academic Knowledge Graph: A Linked Data Source with 8 Billion Triples of Scholarly Data
TLDR
The Microsoft Academic Knowledge Graph (MAKG), a large RDF data set with over eight billion triples with information about scientific publications and related entities, such as authors, institutions, journals, and fields of study, is presented.
A Comparative Survey of DBpedia , Freebase , OpenCyc , Wikidata , and YAGO
TLDR
This survey defines aspects according to which KGs can be analyzed and compares and analyzes and proposes a method for finding the most suitable KG for a given setting.
Annotating and Analyzing Biased Sentences in News Articles using Crowdsourcing
TLDR
A novel news bias dataset is proposed which facilitates the development and evaluation of approaches for detecting subtle bias in news articles and for understanding the characteristics of biased sentences and can serve as resource for related researches including ones focusing on fake news detection.
To Cite, or Not to Cite? Detecting Citation Contexts in Text
TLDR
A method based on a convolutional recurrent neural network for classifying potential citation contexts and determining the “cite-worthiness” for each potential citation context as a step before the actual citation recommendation is beneficial.
Citation recommendation: approaches and datasets
TLDR
This article presents an overview of the approaches and data sets for citation recommendation and identifies differences and commonalities using various dimensions, and sheds light on the evaluation methods and outline general challenges in the evaluation and how to meet them.
Which Knowledge Graph Is Best for Me?
TLDR
This paper intends to ease the access to the in-depth survey by presenting simplified rules that map individual data quality requirements to specific knowledge graphs, but does not intend to replace the previously introduced decision-support framework.
PRoST: Distributed Execution of SPARQL Queries Using Mixed Partitioning Strategies
TLDR
A new RDF store called PRoST (Partitioned RDF on Spark Tables) based on Apache Spark is presented, an innovative strategy that combines the Vertical Partitioning approach with the Property Table, two preexisting models for storing RDF datasets.
Bibliometric-Enhanced arXiv: A Data Set for Paper-Based and Citation-Based Tasks
TLDR
A new data set based on all publications from all scientific fields available on arXiv.org is proposed, which can enhance the future evaluation of research paper-based and citation context-based approaches but also serve as a basis for novel ideas to analyze papers.
...
1
2
3
4
5
...