Automatic record linkage using seeded nearest neighbour and support vector machine classification

@inproceedings{Christen2008AutomaticRL,
  title={Automatic record linkage using seeded nearest neighbour and support vector machine classification},
  author={Peter Christen},
  booktitle={KDD},
  year={2008}
}
The task of linking databases is an important step in an increasing number of data mining projects, because linked data can contain information that is not available otherwise, or that would require time-consuming and expensive collection of specific data. The aim of linking is to match and aggregate all records that refer to the same entity. One of the major challenges when linking large databases is the efficient and accurate classification of record pairs into matches and non-matches. While… CONTINUE READING