Designing an Indonesian part of speech tagset and manually tagged Indonesian corpus

@article{Dinakaramani2014DesigningAI,
  title={Designing an Indonesian part of speech tagset and manually tagged Indonesian corpus},
  author={Arawinda Dinakaramani and Fam Rashel and Andry Luthfi and Ruli Manurung},
  journal={2014 International Conference on Asian Language Processing (IALP)},
  year={2014},
  pages={66-69}
}
We describe our work on designing a linguistically principled part of speech (POS) tagset for the Indonesian language. The process involves a detailed study and analysis of existing tagsets and the manual tagging of an Indonesian corpus. The results of this work are an Indonesian POS tagset consisting of 23 tags and an Indonesian corpus of over 250.000 lexical tokens that have been manually tagged using this tagset. 

2 Figures & Tables

Topic

Statistics

051015201620172018
Citations per Year

Citation Velocity: 5

Averaging 5 citations per year over the last 3 years.

Learn more about how we calculate this metric in our FAQ.