Some entities are more equal than others: statistical methods to consolidate Linked Data

Abstract

We propose a method for consolidating entities in RDF data on the Web. Our approach is based on a statistical analysis of the use of predicates and their associated values to identify “quasi”-key properties. Compared to a purely symbolic based approach, we obtain promising results, retrieving more identical entities with a high precision. We also argue that our technique scales well—possibly to the size of the current Web of Data—as opposed to more expensive existing approaches.

Extracted Key Phrases

4 Figures and Tables

Cite this paper

@inproceedings{Hogan2010SomeEA, title={Some entities are more equal than others: statistical methods to consolidate Linked Data}, author={Aidan Hogan and Axel Polleres and J{\"{u}rgen Umbrich and Antoine Zimmermann}, year={2010} }