Share This Author
Representation and Treatment of Multiword Expressions in Basque
The representation of Basque Multiword Lexical Units and the automatic processing of Multiword Expressions are described and HABIL, a tool for the automaticprocessing of these expressions is described, and some evaluation results are given.
EDBL: a General Lexical Basis for the Automatic Processing of Basque
The paper presents the conceptual schema and the main features of the database, along with some problems encountered in its design and implementation in a commercial DBMS.
Big data for Natural Language Processing: A streaming approach
Automatic morphological analysis of Basque
The components of a robust and wide-coverage morphological analyser for Basque, based on the two-level formalism, are described and improved both the performance of the different components of the system and the description itself.
Morphosyntactic Disambiguation For Basque Based On The Constraint Grammar Formalism
- I. Aduriz, J. M. Arriola, X. Artola, Arantza Díaz de Ilarraza Sánchez, Koldo Gojenola, M. Maritxalar
This paper presents the development of a surface-based morphosyntactic parsing grammar based on the Constraint Grammar formalism, as well as the results obtained, which is the first step in the computational treatment of Basque syntax.
EUSLEM: A Lemmatiser/Tagger for Basque
The lemmatiser/tagger is conceived as a basic tool for other linguistic applications and uses the lexical database and the morphological analyser previously developed and implemented.
Lexical, Knowledge Representation in an Intelligent Dictionary Help System
- Eneko Agirre, Xabier Arregi, X. Artola, A. D. Ilarraza, K. Sarasola
- Computer ScienceCOLING
- 5 August 1994
Intelligent Dictionary System provides various access possibilities to the data, allowing to deduce implicit knowledge from the explicit dictionary information, and deals with reasoning mechanisms analogous to those used by humans when they consult a dictionary.
Two Architectures for Parallel Processing of Huge Amounts of Text
Two alternative NLP architectures to analyze massive amounts of documents, using parallel processing, and the overall gain when they are used for batch as well as for streaming processing is reported.
XUXEN: A Spelling Checker/Corrector for Basque Based on Two-Level Morphology
An extension for continuation class specifications in order to deal with long-distance dependencies is proposed and consists basically of two features added to the standard formalism which allow the lexicon builder to make explicit the interdependencies of morphemes.
A spelling corrector for Basque based on morphology
The Xuxen spelling checker/corrector performs morphological decomposition in order to check misspellings and, to correct them, uses a new strategy which combines the use of an additional two-level morphological subsystem for orthographic errors.