Corpus ID: 3158119

LogLog-Beta and More: A New Algorithm for Cardinality Estimation Based on LogLog Counting

@article{Qin2016LogLogBetaAM,
  title={LogLog-Beta and More: A New Algorithm for Cardinality Estimation Based on LogLog Counting},
  author={J. Qin and Denys Kim and Yumei Tung},
  journal={ArXiv},
  year={2016},
  volume={abs/1612.02284}
}
  • J. Qin, Denys Kim, Yumei Tung
  • Published 2016
  • Computer Science, Mathematics
  • ArXiv
  • The information presented in this paper defines LogLog-Beta. LogLog-Beta is a new algorithm for estimating cardinalities based on LogLog counting. The new algorithm uses only one formula and needs no additional bias corrections for the entire range of cardinalities, therefore, it is more efficient and simpler to implement. Our simulations show that the accuracy provided by the new algorithm is as good as or better than the accuracy provided by either of HyperLogLog or HyperLogLog++. In addition… CONTINUE READING
    4 Citations

    Figures and Topics from this paper

    Explore Further: Topics Discussed in This Paper

    SetSketch: Filling the Gap between MinHash and HyperLogLog
    • PDF
    Estimating Edge-Local Triangle Count Heavy Hitters in Edge-Linear Time and Almost-Vertex-Linear Space
    • 3
    DegreeSketch: Distributed Cardinality Sketches on Massive Graphs with Applications
    • 1
    • Highly Influenced
    • PDF
    Game Theoretic Approaches to Cyber Security: Challenges, Results, and Open Problems
    • 2
    • PDF

    References

    SHOWING 1-10 OF 18 REFERENCES
    HyperLogLog: the analysis of a near-optimal cardinality estimation algorithm
    • 452
    • Highly Influential
    • PDF
    Loglog counting of large cardinalities
    • 214
    • Highly Influential
    A unified scheme for generalizing cardinality estimators to sum aggregation
    • 5
    • PDF
    A linear-time probabilistic counting algorithm for database applications
    • 358
    • Highly Influential
    • PDF
    Probabilistic Counting Algorithms for Data Base Applications
    • 1,195
    • Highly Influential
    • PDF
    An optimal algorithm for the distinct elements problem
    • 273
    • PDF
    On synopses for distinct-value estimation under multiset operations
    • 146
    • PDF
    Bitmap Algorithms for Counting Active Flows on High-Speed Links
    • 309
    • PDF