The recognition of mathematics notation by a computer is made di cult by the two dimensional nature of the parsing problem as well as by the richness and ambiguity of the notation Parsing mathematics typeset in TEX constitutes a simpli ed idealized D recognition problem allowing the recognition engine to concentrate more on semantic understanding Choosing TEX as an input form for mathematics is immediately desirable for document recognition because of the availability of many published works in…

This project was initially driven by the goal of scanning and parsing some 5,000 pages of elaborate mathematics (tables of definite integrals), and while the prototype system demonstrates success on translating noise-free typeset equations into Lisp expressions appropriate for further processing, a more semantic top-down approach appears necessary for higher levels of performance.