XSEED: Accurate and Fast Cardinality Estimation for XPath Queries

Abstract

We propose XSEED, a synopsis of path queries for cardinality estimation that is accurate, robust, efficient, and adaptive to memory budgets. XSEED starts from a very small kernel, and then incrementally updates information of the synopsis. With such an incremental construction, a synopsis structure can be dynamically configured to accommodate different memory budgets. Cardinality estimation based on XSEED can be performed very efficiently and accurately. Extensive experiments on both synthetic and real data sets show that even with less memory, XSEED could achieve accuracy that is an order of magnitude better than that of other synopsis structures. The cardinality estimation time is under 2% of the actual querying time for a wide range of queries in all test cases.

DOI: 10.1109/ICDE.2006.178

Extracted Key Phrases

9 Figures and Tables

Statistics

051015'05'06'07'08'09'10'11'12'13'14'15'16'17
Citations per Year

74 Citations

Semantic Scholar estimates that this publication has 74 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@article{Zhang2006XSEEDAA, title={XSEED: Accurate and Fast Cardinality Estimation for XPath Queries}, author={Ning Zhang and M. Tamer {\"{O}zsu and Ashraf Aboulnaga and Ihab F. Ilyas}, journal={22nd International Conference on Data Engineering (ICDE'06)}, year={2006}, pages={61-61} }