# Quantiles over data streams: an experimental study

- Published 2013 in SIGMOD Conference
DOI:10.1145/2463676.2465312

A fundamental problem in data management and analysis is to generate descriptions of the distribution of data. It is most common to give such descriptions in terms of the cumulative distribution, which is characterized by the quantiles of the data. The design and engineering of efficient methods to find these quantiles has attracted much study, especially in the case where the data is described incrementally, and we must compute the quantiles in an online, streaming fashion. Yet while such… CONTINUE READING

