• Publications
  • Influence
Big data for Natural Language Processing: A streaming approach
TLDR
The use of Storm is explored in a new approach for scalable distributed language processing across multiple machines and its effectiveness and efficiency when processing documents on a medium and large scale are evaluated. Expand
Representation and Treatment of Multiword Expressions in Basque
TLDR
The representation of Basque Multiword Lexical Units and the automatic processing of Multiword Expressions are described and HABIL, a tool for the automaticprocessing of these expressions is described, and some evaluation results are given. Expand
Automatic morphological analysis of Basque
TLDR
The components of a robust and wide-coverage morphological analyser for Basque, based on the two-level formalism, are described and improved both the performance of the different components of the system and the description itself. Expand
Morphosyntactic Disambiguation For Basque Based On The Constraint Grammar Formalism
TLDR
This paper presents the development of a surface-based morphosyntactic parsing grammar based on the Constraint Grammar formalism, as well as the results obtained, which is the first step in the computational treatment of Basque syntax. Expand
EDBL: a General Lexical Basis for the Automatic Processing of Basque
TLDR
The paper presents the conceptual schema and the main features of the database, along with some problems encountered in its design and implementation in a commercial DBMS. Expand
EUSLEM: A Lemmatiser/Tagger for Basque
TLDR
The lemmatiser/tagger is conceived as a basic tool for other linguistic applications and uses the lexical database and the morphological analyser previously developed and implemented. Expand
Lexical, Knowledge Representation in an Intelligent Dictionary Help System
TLDR
Intelligent Dictionary System provides various access possibilities to the data, allowing to deduce implicit knowledge from the explicit dictionary information, and deals with reasoning mechanisms analogous to those used by humans when they consult a dictionary. Expand
XUXEN: A Spelling Checker/Corrector for Basque Based on Two-Level Morphology
TLDR
An extension for continuation class specifications in order to deal with long-distance dependencies is proposed and consists basically of two features added to the standard formalism which allow the lexicon builder to make explicit the interdependencies of morphemes. Expand
A word-grammar based morphological analyzer for agglutinative languages
TLDR
The work here presented proposes a model for designing a full morphological analyzer that integrates the two-level formalism and a unification-based formalism, and proposes to separate the treatment of sequential and non-sequential morphotactic constraints. Expand
A spelling corrector for Basque based on morphology
TLDR
The Xuxen spelling checker/corrector performs morphological decomposition in order to check misspellings and, to correct them, uses a new strategy which combines the use of an additional two-level morphological subsystem for orthographic errors. Expand
...
1
2
3
4
5
...