Entity resolution with iterative blocking

@inproceedings{Whang2009EntityRW,
  title={Entity resolution with iterative blocking},
  author={Steven Euijong Whang and David Menestrina and Georgia Koutrika and Martin Theobald and Hector Garcia-Molina},
  booktitle={SIGMOD Conference},
  year={2009}
}
Entity Resolution (ER) is the problem of identifying which records in a database refer to the same real-world entity. An exhaustive ER process involves computing the similarities between pairs of records, which can be very expensive for large datasets. Various blocking techniques can be used to enhance the performance of ER by dividing the records into blocks in multiple ways and only comparing records within the same block. However, most blocking techniques process blocks separately and do not… CONTINUE READING
Highly Influential
This paper has highly influenced 13 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 229 citations. REVIEW CITATIONS

From This Paper

Topics from this paper.

Citations

Publications citing this paper.

230 Citations

02040'10'12'14'16'18
Citations per Year
Semantic Scholar estimates that this publication has 230 citations based on the available data.

See our FAQ for additional information.

References

Publications referenced by this paper.

Bigmatch: A program for extracting probable matches from a large file for record linkage

  • W. Yancey
  • US Bureau of the Census, Tech. Rep., 2002. 12
  • 2002
Highly Influential
3 Excerpts

Similar Papers

Loading similar papers…