A Common Parts-of-Speech Tagset Framework for Indian Languages

Abstract

We present a universal Parts-of-Speech (POS) tagset framework covering most of the Indian languages (ILs) following the hierarchical and decomposable tagset schema. In spite of significant number of speakers, there is no workable POS tagset and tagger for most ILs, which serve as fundamental building blocks for NLP research. Existing IL POS tagsets are… (More)

2 Figures and Tables

Topics

  • Presentations referencing similar topics