Adaptive Windows for Duplicate Detection

  title={Adaptive Windows for Duplicate Detection},
  author={Uwe Draisbach and Felix Naumann and Sascha Szott and Oliver Wonneberg},
  journal={2012 IEEE 28th International Conference on Data Engineering},
Duplicate detection is the task of identifying all groups of records within a data set that represent the same real-world entity, respectively. This task is difficult, because (i) representations might differ slightly, so some similarity measure must be defined to compare pairs of records and (ii) data sets might have a high volume making a pair-wise comparison of all records infeasible. To tackle the second problem, many algorithms have been suggested that partition the data set and compare… CONTINUE READING
Highly Cited
This paper has 70 citations. REVIEW CITATIONS
39 Citations
24 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 39 extracted citations

71 Citations

Citations per Year
Semantic Scholar estimates that this publication has 71 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 24 references

Adaptive windows for duplicate detection

  • U. Draisbach, F. Naumann, S. Szott, O. Wonneberg
  • Hasso-Plattner-Institut für Softwaresystemtechnik…
  • 2011
1 Excerpt

Industryscale duplicate detection

  • M. Weis, F. Naumann, U. Jehle, J. Lufter, H. Schuster
  • Proceedings of the VLDB Endowment, vol. 1, no. 2…
  • 2008
1 Excerpt

Similar Papers

Loading similar papers…