Automatic Categorization of Author Gender via N-Gram Analysis

  title={Automatic Categorization of Author Gender via N-Gram Analysis},
  author={Jonathan Doyle and Vlado Keselj},
We present a method for automatic categorization of author gender via n-gram analysis. Using a corpus of British student essays, experiments using character-level, wordlevel, and part-of-speech n-grams are performed. The peak accuracy for all methods is roughly equal, reaching a maximum of 81%. These results are on par with other, established techniques, while retaining the simplicity and ease-of-generalization inherent in n-gram techniques. 

From This Paper

Figures, tables, and topics from this paper.


Publications referenced by this paper.
Showing 1-10 of 18 references

Similar Papers

Loading similar papers…