Generic entity resolution with negative rules

  title={Generic entity resolution with negative rules},
  author={Steven Euijong Whang and Omar Benjelloun and Hector Garcia-Molina},
  journal={The VLDB Journal},
Entity resolution (ER) (also known as deduplication or merge-purge) is a process of identifying records that refer to the same real-world entity and merging them together. In practice, ER results may contain “inconsistencies,” either due to mistakes by the match and merge function writers or changes in the application semantics. To remove the inconsistencies, we introduce “negative rules” that disallow inconsistencies in the ER solution (ER-N). A consistent solution is then derived based on the… CONTINUE READING
Highly Cited
This paper has 43 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 26 extracted citations


Publications referenced by this paper.
Showing 1-10 of 32 references

D-Swoosh: A Family of Algorithms for Generic, Distributed Entity Resolution

27th International Conference on Distributed Computing Systems (ICDCS '07) • 2007

Duplicate Record Detection: A Survey

IEEE Transactions on Knowledge and Data Engineering • 2007
View 1 Excerpt

Overview of record linkage and current rese arch directions

W. Winkler
Tech. rep., Statistical Research Division, U.S. Bureau of the Census, Washington, DC • 2006
View 1 Excerpt

Additi onal experiments on negative rules

S. E. Whang, O. Benjelloun, H. Garcia-Molina
Tech. rep., Stanford Universi ty • 2005
View 1 Excerpt

Similar Papers

Loading similar papers…