Approximate Frequency Counts over Data Streams

@article{Manku2002ApproximateFC,
  title={Approximate Frequency Counts over Data Streams},
  author={Gurmeet Singh Manku and Rajeev Motwani},
  journal={PVLDB},
  year={2002},
  volume={5},
  pages={1699}
}
We present algorithms for computing frequency counts exceeding a user-specified threshold over data streams. Our algorithms are simple and have provably small memory footprints. Although the output is approximate, the error is guaranteed not to exceed a user-specified parameter. Our algorithms can easily be deployed for streams of singleton items like those found in IP network monitoring. We can also handle streams of variable sized sets of items exemplified by a sequence of market basket… CONTINUE READING
Highly Influential
This paper has highly influenced a number of papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 1,589 citations. REVIEW CITATIONS

Topic

Statistics

050100150'03'05'07'09'11'13'15'17
Citations per Year

1,589 Citations

Semantic Scholar estimates that this publication has 1,589 citations based on the available data.

See our FAQ for additional information.