Improving the Quality of Linked Data Using Statistical Distributions

  title={Improving the Quality of Linked Data Using Statistical Distributions},
  author={Heiko Paulheim and Christian Bizer},
  journal={Int. J. Semantic Web Inf. Syst.},
Linked Data on the Web is either created from structured data sources (such as relational databases), from semi-structured sources (such as Wikipedia), or from unstructured sources (such as text). In the latter two cases, the generated Linked Data will likely be noisy and incomplete. In this paper, we present two algorithms that exploit statistical distributions of properties and types for enhancing the quality of incomplete and noisy Linked Data sets: SDType adds missing type statements, and… CONTINUE READING

6 Figures & Tables



Citations per Year

66 Citations

Semantic Scholar estimates that this publication has 66 citations based on the available data.

See our FAQ for additional information.