A frequency-based approach for mining coverage statistics in data integration

@article{Nie2004AFA,
  title={A frequency-based approach for mining coverage statistics in data integration},
  author={Zaiqing Nie and Subbarao Kambhampati},
  journal={Proceedings. 20th International Conference on Data Engineering},
  year={2004},
  pages={387-398}
}
Query optimization in data integration requires source coverage and overlap statistics. Gathering and storing the required statistics presents many challenges, not the least of which is controlling the amount of statistics learned. We introduce StatMiner, a novel statistics mining approach which automatically generates attribute value hierarchies, efficiently discovers frequently accessed query classes based on the learned attribute value hierarchies, and learns statistics only with respect to… CONTINUE READING

From This Paper

Topics from this paper.

Citations

Publications citing this paper.

Similar Papers

Loading similar papers…