Engineering the compression of massive tables: an experimental approach

  title={Engineering the compression of massive tables: an experimental approach},
  author={Adam L. Buchsbaum and Donald F. Caldwell and Kenneth Ward Church and Glenn S. Fowler and S. Muthukrishnan},
We study the problem of compressing massive tables. We devise a novel compression paradigm--training for lossless compression-which assumes that the data exhibit dependencies that can be learned by examining a small amount of training material. We develop an experimental methodology to test the approach. Our result is a system, pz ip, which outperforms gz ip by factors of two in compression size and both compression and uncompression time for various tabular data. P z i p is now in production… CONTINUE READING
Highly Cited
This paper has 70 citations. REVIEW CITATIONS


Publications citing this paper.

70 Citations

Citations per Year
Semantic Scholar estimates that this publication has 70 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-5 of 5 references

Similar Papers

Loading similar papers…