Learning-Based Approaches for Matching Web Data Entities

  title={Learning-Based Approaches for Matching Web Data Entities},
  author={Hanna K{\"o}pcke and Andreas Thor and Erhard Rahm},
  journal={IEEE Internet Computing},
Entity matching is a key task for data integration and especially challenging for Web data. Effective entity matching typically requires combining several match techniques and finding suitable configuration parameters, such as similarity thresholds. The authors investigate to what degree machine learning helps semi-automatically determine suitable match strategies with a limited amount of manual training effort. They use a new framework, Fever, to evaluate several learning-based approaches for… CONTINUE READING
Highly Cited
This paper has 42 citations. REVIEW CITATIONS

From This Paper

Topics from this paper.


Publications citing this paper.
Showing 1-10 of 24 extracted citations

Revenue maximizing itemset construction for online shopping services

Industrial Management and Data Systems • 2013
View 4 Excerpts
Highly Influenced


Publications referenced by this paper.
Showing 1-10 of 14 references

Duplicate Record Detection: A Survey

IEEE Transactions on Knowledge and Data Engineering • 2007
View 2 Excerpts

Exampledriven design of efficient record machting

S. Chaudhuri, Chen, B.-C, V. Ganti, R. Kaushik
View 2 Excerpts

Data Quality: Concepts, Methodologies and Techniques

Data-Centric Systems and Applications • 2006
View 1 Excerpt

Rapid Prototyping for Complex Data Mining Tasks

I Mierswa
ACM SIGKDD, • 2006
View 1 Excerpt

Similar Papers

Loading similar papers…