Prepositional Phrase Attachment through a Backed-Off Model

Abstract

Recent work has considered corpus-based or statistical approaches to the problem of prepositional phrase attachment ambiguity. Typically, ambiguous verb phrases of the form v rip1 p rip2 are resolved through a model which considers values of the four head words (v, nl, p and 77,2). This paper shows that the problem is analogous to n-gram language models in speech recognition, and that one of the most common methods for language modeling, the backed-off estimate, is applicable. Results on Wall Street Journal data of 84.5% accuracy are obtained using this method. A surprising result is the importance of low-count events-ignoring events which occur less than 5 times in training data reduces performance to 81.6%.

Extracted Key Phrases

Statistics

01020'96'98'00'02'04'06'08'10'12'14'16
Citations per Year

255 Citations

Semantic Scholar estimates that this publication has received between 208 and 318 citations based on the available data.

See our FAQ for additional information.