Determining an author's native language by mining a text for errors

@inproceedings{Koppel2005DeterminingAA,
  title={Determining an author's native language by mining a text for errors},
  author={Moshe Koppel and Jonathan Schler and Kfir Zigdon},
  booktitle={KDD},
  year={2005}
}
In this paper, we show that stylistic text features can be exploited to determine an anonymous author's native language with high accuracy. Specifically, we first use automatic tools to ascertain frequencies of various stylistic idiosyncrasies in a text. These frequencies then serve as features for support vector machines that learn to classify texts according to author native language. 

From This Paper

Figures, tables, and topics from this paper.

Explore Further: Topics Discussed in This Paper

Citations

Publications citing this paper.
SHOWING 1-10 OF 109 CITATIONS, ESTIMATED 28% COVERAGE

384 Citations

020406080'09'12'15'18
Citations per Year
Semantic Scholar estimates that this publication has 384 citations based on the available data.

See our FAQ for additional information.

Similar Papers

Loading similar papers…