Domain-independent data cleaning via analysis of entity-relationship graph

  title={Domain-independent data cleaning via analysis of entity-relationship graph},
  author={Dmitri V. Kalashnikov and Sharad Mehrotra},
  journal={ACM Trans. Database Syst.},
In this article, we address the problem of reference disambiguation. Specifically, we consider a situation where entities in the database are referred to using descriptions (e.g., a set of instantiated attributes). The objective of reference disambiguation is to identify the unique entity to which each description corresponds. The key difference between the approach we propose (called RelDC) and the traditional techniques is that RelDC analyzes not only object features but also inter-object… CONTINUE READING
Highly Cited
This paper has 193 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 122 extracted citations

Duplicate detection in XML data

View 11 Excerpts
Highly Influenced

Data and Information Quality

Data-Centric Systems and Applications • 2016
View 7 Excerpts
Highly Influenced

Data Matching

Data-Centric Systems and Applications • 2012
View 10 Excerpts
Highly Influenced

Scalable Iterative Graph Duplicate Detection

IEEE Transactions on Knowledge and Data Engineering • 2012
View 6 Excerpts
Highly Influenced

193 Citations

Citations per Year
Semantic Scholar estimates that this publication has 193 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.

Similar Papers

Loading similar papers…