Part-of-speech histograms for genre classification of text

@article{Feldman2009PartofspeechHF,
  title={Part-of-speech histograms for genre classification of text},
  author={Sergey Feldman and Marius A. Marin and Mari Ostendorf and Maya R. Gupta},
  journal={2009 IEEE International Conference on Acoustics, Speech and Signal Processing},
  year={2009},
  pages={4781-4784}
}
This work addresses the problem of classifying the genre of text, which is useful for a variety of language processing problems. We propose statistics of POS histograms as classification features, coupled with a quadratic discriminant classifier. In experiments on six different text and speech genres, we demonstrate enhanced performance compared to standard techniques using word frequency count features and POS trigram features. Experiments on genres that were not seen in training show… CONTINUE READING
Highly Cited
This paper has 27 citations. REVIEW CITATIONS

From This Paper

Figures, tables, and topics from this paper.

References

Publications referenced by this paper.
Showing 1-10 of 19 references

Automatic detection of text genre

B. Kessler, G. Numberg, H. Schütze
ACL-35, 1997, pp. 32–38. • 1997
View 4 Excerpts
Highly Influenced

Filtering web text to match target genres

2009 IEEE International Conference on Acoustics, Speech and Signal Processing • 2009

Rapid language model development using external resources for new spoken dialog domains

Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005. • 2005
View 1 Excerpt

Similar Papers

Loading similar papers…