Joint deduplication of multiple record types in relational data

@inproceedings{Culotta2005JointDO,
  title={Joint deduplication of multiple record types in relational data},
  author={Aron Culotta and Andrew McCallum},
  booktitle={CIKM},
  year={2005}
}
Record deduplication is the task of merging database records that refer to the same underlying entity. In relational data-bases, accurate deduplication for records of one type is often dependent on the decisions made for records of other types. Whereas nearly all previous approaches have merged records of different types independently, this work models these inter-dependencies explicitly to collectively deduplicate records of multiple types. We construct a conditional random field model of… CONTINUE READING
Highly Cited
This paper has 89 citations. REVIEW CITATIONS

From This Paper

Figures, tables, results, connections, and topics extracted from this paper.
59 Extracted Citations
0 Extracted References
Similar Papers

Citing Papers

Publications influenced by this paper.

89 Citations

051015'07'10'13'16
Citations per Year
Semantic Scholar estimates that this publication has 89 citations based on the available data.

See our FAQ for additional information.

Similar Papers

Loading similar papers…