The predictability of letters in written english

@article{Schrmann1996ThePO,
  title={The predictability of letters in written english},
  author={Thomas Sch{\"u}rmann and Peter Grassberger},
  journal={ArXiv},
  year={1996},
  volume={abs/0710.4516}
}
  • Thomas Schürmann, Peter Grassberger
  • Published in ArXiv 1996
  • Computer Science, Mathematics, Physics
  • We show that the predictability of letters in written English texts depends strongly on their position in the word. The first letters are usually the least easy to predict. This agrees with the intuitive notion that words are well defined subunits in written languages, with much weaker correlations across these units than within them. It implies that the average entropy of a letter deep inside a word is roughly 4–5 times smaller than the entropy of the first letter. 

    Create an AI-powered research feed to stay up to date with new papers like this posted to ArXiv

    1
    Twitter Mention

    Figures from this paper.

    References

    Publications referenced by this paper.
    SHOWING 1-6 OF 6 REFERENCES

    Prediction and entropy of printed English

    VIEW 4 EXCERPTS
    HIGHLY INFLUENTIAL

    J

    • T. C. Bell
    • G. Cleary and I. H. Witten, ”Text Compression”
    • 1990
    VIEW 4 EXCERPTS
    HIGHLY INFLUENTIAL

    Entropy estimation of symbol sequences.

    J

    • M. J. Weinberger
    • J. Rissanen and M. Feder, ”A universal finite memory source,” IEEE Trans. Inform. Theory 41, 643
    • 1995

    Evaluation of the entropy of a language by an improved prediction method with application to printed Hebrew,

    • L. B. Levitin, Z. Reingold
    • (Tel Aviv Univ.,
    • 1994
    VIEW 3 EXCERPTS

    A convergent gambling estimate of the entropy of English

    VIEW 1 EXCERPT