A maximum entropy approach for integrating semantic information in statistical language models

Abstract

In this paper, we propose an adaptive statistical language model, which successfully incorporates the semantic information into an n-gram model. Traditional n-gram models exploit only the immediate context of history. We first introduce the semantic topic as a new source to extract the long distance information for language modeling, and then adopt the maximum entropy (ME) approach instead of the conventional linear interpolation method to integrate the semantic information with the n-gram model. Using the ME approach, each information source gives rise to a set of constraints, which should be satisfied to achieve the hybrid model. In the experiments, the ME language models, trained using the China Times newswire corpus, achieved 40% perplexity reduction over the baseline bigram model.

DOI: 10.1109/CHINSL.2004.1409648

3 Figures and Tables

Cite this paper

@article{Chueh2004AME, title={A maximum entropy approach for integrating semantic information in statistical language models}, author={Chuang-Hua Chueh and Jen-Tzung Chien and Hsin-Min Wang}, journal={2004 International Symposium on Chinese Spoken Language Processing}, year={2004}, pages={309-312} }