Compression of small text files using syllables

  title={Compression of small text files using syllables},
  author={Jan Lansky and Michal Zemlicka},
  journal={Data Compression Conference (DCC'06)},
  pages={1 pp.-458}
Summary form only given. We adapted well-known algorithms of adaptive Huffman coding and LZW to use syllables and words instead of characters for text compression. We tested the algorithms on collections of small or middle-sized files. Using syllable-based compression algorithms on English documents gives expected results: they outperform character-based and are outperformed by word-based versions of the same algorithm. According our tests both syllable- and word-based compression methods are… CONTINUE READING
Highly Cited
This paper has 23 citations. REVIEW CITATIONS


Publications referenced by this paper.
Showing 1-4 of 4 references

Compression of small text files using syllables

J. Lánský, M. Žemlička
Technical Report 1, Charles University, Faculty of Mathematics and Physics, Department of Software Engineering, • 2006
View 1 Excerpt

Arithmetic Coding Revisited

ACM Trans. Inf. Syst. • 1998
View 1 Excerpt

A block sorting loseless data compression algorithm

M. Burrows, D. J. Wheeler
Technical report, • 1994
View 1 Excerpt

Similar Papers

Loading similar papers…