Record Matching : Improving Performance in Classification Cyju

  title={Record Matching : Improving Performance in Classification Cyju},
  author={Elizabeth M Varghese and G. Naveen Sundar},
Duplication detection identifies the records that represent the same real-world entity. This is a vital process in data integration. Record matching refers to the task of finding entries that refer to the same entity in two or more files. Performing record matching solves the duplication detection problems; hence the needs for identifying the suitable record matching technique follow. Supervised methods are the current techniques used for duplication detection. This requires the user to provide… CONTINUE READING

From This Paper

Figures, tables, and topics from this paper.


Publications referenced by this paper.
Showing 1-10 of 18 references

Efficient Algorithm for Localized Support Vector Machine,

  • Haibin Cheng, Pang-Ning Tan, Member, IEEE, Rong Jin
  • IEEE Transaction Knowledge and Data Eng.,
  • 2010
1 Excerpt

Goiser, “Quality and Complexity Measures for Data Linkage and Deduplication,

  • K. P. Christen
  • Quality Measures in Data Mining,
  • 2007
1 Excerpt


  • W. Su
  • Wang, and F.H. Lochovsky, “Holistic Schema…
  • 2006
1 Excerpt


  • P. Christen
  • and M. Hegland, “Febrl—A Parallel Open Source…
  • 2004