On integrating catalogs

@inproceedings{Agrawal2001OnIC,
  title={On integrating catalogs},
  author={Rakesh Agrawal and Ramakrishnan Srikant},
  booktitle={WWW},
  year={2001}
}
We address the problem of integrating documents from different sources into a master catalog. This problem is pervasive in web marketplaces and portals. Current technology for automating this process consists of building a classifier that uses the categorization of documents in the master catalog to construct a model for predicting the category of unknown documents. Our key insight is that many of the data sources have their own categorization, and classification accuracy can be improved by… CONTINUE READING

Similar Papers

Citations

Publications citing this paper.
SHOWING 1-10 OF 93 CITATIONS

Web taxonomy integration through co-bootstrapping

VIEW 15 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Web taxonomy integration using support vector machines

VIEW 15 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

TACI: Taxonomy-Aware Catalog Integration

  • IEEE Transactions on Knowledge and Data Engineering
  • 2013
VIEW 7 EXCERPTS
CITES METHODS, BACKGROUND & RESULTS
HIGHLY INFLUENCED

Integrating Community Question and Answer Archives

VIEW 5 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Learning to integrate web taxonomies

  • J. Web Semant.
  • 2004
VIEW 10 EXCERPTS
CITES BACKGROUND & METHODS
HIGHLY INFLUENCED

A Clustering-based Approach for Supporting Document-Category Integration

VIEW 6 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Cross-training: learning probabilistic mappings between topics

VIEW 7 EXCERPTS
CITES BACKGROUND, METHODS & RESULTS
HIGHLY INFLUENCED

FILTER CITATIONS BY YEAR

2002
2018

CITATION STATISTICS

  • 17 Highly Influenced Citations

References

Publications referenced by this paper.

Machine Learning

T. M. Mitchell
  • McGraw-Hill,
  • 1997
VIEW 4 EXCERPTS
HIGHLY INFLUENTIAL