Interoperability of Annotation Schemes: Using the Pepper Framework to Display AWA Documents in the ANNIS Interface

Abstract

Natural language processing applications are frequently integrated to solve complex linguistic problems, but the lack of interoperability between these tools tends to be one of the main issues found in that process. That is often caused by the different linguistic formats used across the applications, which leads to attempts to both establish standard formats to represent linguistic information and to create conversion tools to facilitate this integration. Pepper is an example of the latter, as a framework that helps the conversion between different linguistic annotation formats. In this paper, we describe the use of Pepper to convert a corpus linguistically annotated by the annotation scheme AWA into the relANNIS format, with the ultimate goal of interacting with AWA documents through the ANNIS interface. The experiment converted 40 megabytes of AWA documents, allowed their use on the ANNIS interface, and involved making architectural decisions during the mapping from AWA into relANNIS using Pepper. The main issues faced during this process were due to technical issues mainly caused by the integration of the different systems and projects, namely AWA, Pepper and ANNIS.

5 Figures and Tables

Cite this paper

@inproceedings{Carlotto2016InteroperabilityOA, title={Interoperability of Annotation Schemes: Using the Pepper Framework to Display AWA Documents in the ANNIS Interface}, author={Talvany Carlotto and Zuhaitz Beloki and Xabier Artola and Aitor Soroa}, booktitle={LREC}, year={2016} }