N ov 2 00 7 Patterns of i . i . d . Sequences and Their Entropy - Part I : General Bounds ∗

Abstract

Tight bounds on the block entropy of patterns of sequences generated by independent and identically distributed (i.i.d.) sources are derived. A pattern of a sequence is a sequence of integer indices with each index representing the order of first occurrence of the respective symbol in the original sequence. Since a pattern is the result of data processing on the original sequence, its entropy cannot be larger. Bounds derived here describe the pattern entropy as function of the original i.i.d. source entropy, the alphabet size, the symbol probabilities, and their arrangement in the probability space. Matching upper and lower bounds derived provide a useful tool for very accurate approximations of pattern block entropies for various distributions, and for assessing the decrease of the pattern entropy from that of the original i.i.d. sequence.

1 Figure or Table

Cite this paper

@inproceedings{Shamir2007NO2, title={N ov 2 00 7 Patterns of i . i . d . Sequences and Their Entropy - Part I : General Bounds ∗}, author={Gil I. Shamir}, year={2007} }