Joseba Abaitua

Learn More
This paper shows how the framework of Rhetorical Structure Theory (RST) for discourse modelling can be expressed through XML annotations and then used to implement a natural language generation (NLG) system for the web. The system applies simplified RST schemes to the elaboration of a master document in XML from which content segments are chosen to suit the(More)
One of the key objectives of web service technology is to construct processes that enable service providers to interconnect with their clients. The industry has developed the UDDI registry to enable the register of services and to make them available to their users. However, the keyword-based search of UDDI frequently return no results because there is no(More)
Context-aware systems allow users to access services and multimedia data according to their current context (location, identity, preferences). Web 2.0 fosters user contribution and presents the web as an application programming platform where third parties can create new applications (mash-ups) mixing the functionality offered by others. We deem that(More)
The notion of context has been widely studied and there are several authors that have proposed different definitions of context. However, context has not been widely studied in the framework of human mobility and the notion of context has been imported directly from other computing fields without specifically addressing the tourism domain requirements. In(More)
Content selection is a key factor of any successful document generation system. This paper shows how a content selection algorithm has been implemented using an efficient combination o f XML/XSL technology and the framework of RST for discourse modeling. The system generates multilingual documents adapted to user profiles in a learning environment for the(More)
Extensively annotated bilingual parallel corpora can be exploited to feed editing tools that integrate the processes of document composition and translation. Here we discuss the architecture of an interactive editing tool that, on top of techniques common to most Translation Memory-based systems, applies the potential of SGML's DTDs to guide the process of(More)
Rich markup can considerably benefit the process of establishing bitext correspondences, that is, the task of providing correct identification and alignment methods for text segments that are translation equivalences of each other in a parallel corpus. We present a sentence alignment algorithm that, by taking advantage of previously annotated texts, obtains(More)
Resumen Se presenta el doble trabajo doctoral en marcha relacionado con la resolución de correferencias: uno es la adaptación de los algoritmos conocidos al español, y otro la aplicación a un corpus experimental de textos breves (mensajería y tuits). Además de ello, se presenta una hipotética alimentación de la resolución de correferencias a recursos(More)
In this paper we describe and compare three approaches for the automatic extraction of medical terms using noun phrases (NPs) previously recognized on medical text corpus in Spanish. In the first approach , as baseline, we extracted all NPs, while for the second and third ones the extraction process is directed to " specific NPs " that are determined on the(More)