Improving Unsupervised Stemming by using Partial Lemmatization Coupled with Data-based Heuristics for Hindi

@inproceedings{Gupta2012ImprovingUS,
  title={Improving Unsupervised Stemming by using Partial Lemmatization Coupled with Data-based Heuristics for Hindi},
  author={Deepa Gupta and Rahul Kumar Yadav and Nidhi Sajan},
  year={2012}
}
Stemming and Lemmatization are two important natural language processing techniques widely used in Information Retrieval (IR) for query processing and in Machine Translation (MT) for reducing the data sparseness. Both minimize inflectional forms, and sometimes derivationally related forms of a word, to a common base form. Most of the existing stemmer and lemmatization work is based either on some language dependent rules which require the supervision of a language expert, or some probabilistic… CONTINUE READING

Similar Papers

Loading similar papers…