Corpus ID: 1607756

Design and Development of a Named Entity Recognizer for an Agglutinative Language

@inproceedings{Alegria2004DesignAD,
  title={Design and Development of a Named Entity Recognizer for an Agglutinative Language},
  author={I. Alegria and Olatz Arregi and Irene Balza and N. Ezeiza and Izaskun Fern{\'a}ndez and R. Urizar},
  year={2004}
}
This paper presents the conclusions reached from the development of a system for Named Entity recognition in written Basque. The system was designed in four steps: first, the development of a recognizer based on linguistic information represented on finitestate-transducers; second, the generation of semi-automatically annotated corpora from the result of these transducers; third, the achievement of the best possible recognizer by training different ML techniques on these corpora; and finally… Expand

Tables from this paper

Simple or Complex? Assessing the readability of Basque Texts
Using Machine Learning Techniques to Build a Comma Checker for Basque
Document Expansion for Cross-Lingual Passage Retrieval
Elhuyar-IXA: Semantic Relatedness and Cross-lingual Passage Retrieval
An XML Framework for a Basque Question Answering System
...
1
2
...

References

SHOWING 1-10 OF 15 REFERENCES
A Simple Named Entity Extractor using AdaBoost
Overview of MUC-7
Finite State Morphology
Description of the LTG System Used for MUC-7
...
1
2
...