Query sampling in DB2 Universal Database

@inproceedings{Gryz2004QuerySI,
  title={Query sampling in DB2 Universal Database},
  author={J. Gryz and Junjie Guo and L. Liu and C. Zuzarte},
  booktitle={SIGMOD '04},
  year={2004}
}
  • J. Gryz, Junjie Guo, +1 author C. Zuzarte
  • Published in SIGMOD '04 2004
  • Computer Science
  • Executing ad hoc queries against large databases can be prohibitively expensive. Exploratory analysis of data may not require exact answers to queries, however: results based on sampling the data are often satisfactory. Supporting sampling as a primitive SQL operator turns out to be difficult because sampling does not commute with many SQL operators.In this paper, we describe an implementation in IBM® DB2® Universal Database (UDB) of a sampling operator that commutes with some SQL operators. As… CONTINUE READING
    12 Citations

    Topics from this paper.

    Fast approximate computation of statistics on views
    • 1
    Sampling algorithms for evolving datasets
    • 15
    • Highly Influenced
    • PDF
    A Sampling Algebra for Aggregate Estimation
    • 29
    • PDF
    Histograms for OLAP and Data-Stream Queries
    • F. Buccafurri
    • Computer Science
    • Encyclopedia of Data Warehousing and Mining
    • 2009
    • 3
    • PDF
    MISS: Finding Optimal Sample Sizes for Approximate Analytics
    Sampling dirty data for matching attributes
    • 26
    • PDF
    Interactive Visualization of Large Data Sets
    • 52
    • PDF
    Duplicate based schema matching
    • 2
    • PDF
    Hierarchisches gruppenbasiertes Sampling

    References

    Starburst Mid-Flight: As the Dust Clears
    • 284
    • Highly Influential
    • PDF