Influence of Word Normalization on Text Classification

@inproceedings{Tomana2007InfluenceOW,
  title={Influence of Word Normalization on Text Classification},
  author={Michal Tomana and Roman Tesara and Karel Jezeka},
  year={2007}
}
  • Michal Tomana, Roman Tesara, Karel Jezeka
  • Published 2007
In this paper we focus our attention on the comparison of various lemmatization and stemming algorithms, which are often used in nature language processing (NLP). Sometimes these two techniques are considered to be identical, but there is an important difference. Lemmatization is generally more utilizable, because it produces the basic word form which is required in many application areas (i.e. cross-language processing and machine translation). However, lemmatization is a difficult task… CONTINUE READING
Highly Cited
This paper has 34 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 22 extracted citations

References

Publications referenced by this paper.
Showing 1-7 of 7 references

A Comparison of Event Models for Naive Bayes Text Classification

  • K. Nigam
  • AAAI / ICML - 98 Workshop on Learning for Text…
  • 1998

An O(ND) Difference Algorithm and Its Variations

  • Eugene W. Myers
  • Algorithmica Vol
  • 1986
1 Excerpt

Similar Papers

Loading similar papers…