Automatic Detection and Correction for Chinese Misspelled Words Using Phonological and Orthographic Similarities

Abstract

How to detect and correct misspelled words in documents is a very important issue for Mandarin and Japanese. This paper uses phonological similarity and orthographic similarity co-occurrence to train linear regression model. Using ACL-SIGHAN 2013 Bake-off Dataset, experimental results indicate that the detection F-score, error location F-score of our… (More)

Topics

4 Figures and Tables