Improved Named Entity Tagset for Punjabi Language

  title={Improved Named Entity Tagset for Punjabi Language},
  author={Amandeep Kaur and Gurpreet Singh Josan},
  journal={2014 Recent Advances in Engineering and Computational Sciences (RAECS)},
Annotated corpus plays an important role in developing machine learning based Named Entity Recognition system. For creating an annotated corpus, it is important to decide in advance the Named Entity Tagset to be used. A Named Entity Tagset is defined as a collection of tags or labels, in the form of a scheme, indicating the named entity class of a word to which it belongs in the text. In this paper we have proposed an improved Named Entity Tagset of 14 tags for the task of Named Entity… CONTINUE READING