An Introduction to Duplicate Detection

@inproceedings{Naumann2010AnIT,
  title={An Introduction to Duplicate Detection},
  author={Felix Naumann and Melanie Herschel},
  booktitle={An Introduction to Duplicate Detection},
  year={2010}
}
With the ever increasing volume of data, data quality problems abound. Multiple, yet different representations of the same real-world objects in data, duplicates, are one of the most intriguing data quality problems. The effects of such duplicates are detrimental; for instance, bank customers can obtain duplicate identities, inventory levels are monitored incorrectly, catalogs are mailed multiple times to the same household, etc. Automatically detecting duplicates is difficult: First, duplicate… CONTINUE READING
Highly Influential
This paper has highly influenced 10 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 226 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 125 extracted citations

226 Citations

02040'11'13'15'17
Citations per Year
Semantic Scholar estimates that this publication has 226 citations based on the available data.

See our FAQ for additional information.

Similar Papers

Loading similar papers…