TYPiMatch: type-specific unsupervised learning of keys and key values for heterogeneous web data integration

@inproceedings{Ma2013TYPiMatchTU,
  title={TYPiMatch: type-specific unsupervised learning of keys and key values for heterogeneous web data integration},
  author={Yongtao Ma and Thanh Tran},
  booktitle={WSDM},
  year={2013}
}
Instance matching and blocking, a preprocessing step used for selecting candidate matches, require determining the most representative attributes of instances called keys, based on which similarities between instances are computed. We show that for the problem of learning blocking keys and key values, both generic techniques that do not exploit type information and supervised learning techniques optimized for one single predefined type of instances do not perform well on heterogeneous Web data… CONTINUE READING

Citations

Publications citing this paper.

Similar Papers

Loading similar papers…