Developing Competitive HMM PoS Taggers Using Small Training Corpora

Abstract

This paper presents a study aiming to find out the best strategy to develop a fast and accurate HMM tagger when only a limited amount of training material is available. This is a crucial factor when dealing with languages for which small annotated material is not easily available. First, we develop some experiments in English, using WSJ corpus as a test… (More)
DOI: 10.1007/978-3-540-30228-5_12

6 Figures and Tables

Topics

  • Presentations referencing similar topics