Introduction and Overview

The explosion of information technology in the last two decades has led to a substantial growth in quantity, diversity and complexity of web-accessible linguistic data. These resources become even more useful when linked with each other, and the last few years have seen the emergence of numerous approaches in various disciplines concerned with linguistic resources. It is the challenge of our time to store, interlink and exploit this wealth of data accumulated in more than half a century of… 

Deconstructing descriptive grammars

The domain of grammaticography is looked at, and a traditional descriptive grammar is reconceptualized as a database of linked data, in principle curated from distinct sources, thereby allowing us to benefit from new technology without losing important features intrinsic to the structure of the traditional version of the resource.



XML-based Stand-off Representation and Exploitation of Multi-Level Linguistic Annotation

An XML-based, generic stand-off architecture for multi-level linguistic annotations is proposed and an example instantiation of this architecture is presented and application scenarios that profit from this architecture are sketched out.

Encoding Linguistic Corpora

The CES identifies a minimal encoding level that corpora must achieve to be considered standardized in terms of descriptive representation (marking of structural and linguistic information) and provides encoding conventions for more extensive encoding and for linguistic annotation, as well as general architecture for representing corpora annotated for linguistic features.

The Semantic Gap of Formalized Meaning

OWL is deployed as a Meaning Representation Language and a unified model is created, which combines existing NLP methods with Linguistic knowledge and aggregates disambiguated background knowledge from the Web of Data to improve the efficiency of methods in NLP and Ontology learning.

A formal framework for linguistic annotation

SKOS Simple Knowledge Organization System Reference

This document defines the Simple Knowledge Organization System (SKOS), a common data model for sharing and linking knowledge organization systems via the Web, which provides a standard, low-cost migration path for porting existing knowledge organizations systems to the Semantic Web.

What Does Interoperability Mean , Anyway ? Toward an Operational Definition of Interoperability for Language Technology

The results of a recent workshop whose goal was to arrive at operational definitions for interoperability over four thematic areas, including metadata for describing language resources, data categories and their semantics, resource publication requirements, and software sharing are focused on.

GrAF: A Graph-based Format for Linguistic Annotations

GrAF is an extension of the Linguistic Annotation Framework developed within ISO TC37 SC4 and as such, implements state-of-the-art best practice guidelines for representing linguistic annotations and allows for the application of well-established graph traversal and analysis algorithms.

OWL Web ontology language overview

This document provides an introduction to OWL by informally describing the features of each of the sublanguages of OWL, the Web Ontology Language by providing additional vocabulary along with a formal semantics.

The Georgetown-IBM experiment demonstrated in January 1954

The public demonstration of a Russian-English machine translation system in New York in January 1954 caused a great deal of public interest and much controversy and raised expectations of automatic systems capable of high quality translation in the near future.

Formalising Multi-layer Corpora in OWL DL - Lexicon Modelling, Querying and Consistency Control

We present a general approach to formally modelling corpora with multi-layered annotation, thereby inducing a lexicon model in a typed logical representation language, OWL DL. This model can be