- Full text PDF available (6)
Data Set Used
The paper reports on the design and construction of a multi-layered corpus of Italian, annotated at the syntactic and lexico-semantic levels, whose development is supported by dedicated software augmented with an intelligent interface. The issue of evaluating this type of resource is also addressed.
 Dan Klein and Christopher D. Manning. Fast exact inference with a factored model for natural language parsing. Another more practical line of activity includes an error analysis to identify the classes of errors done by the two algorithms, so that strategies to cope with them can be designed. For Collins' parsers this would imply the introduction of… (More)
Corpora annotated at semantic level play a crucial role both in research and in applicative contexts in which systems of natural language processing are studied and developed. In this paper we present the lexico-semantic annotation of an Italian treebank, a first attempt to recover the lack of such resource for Italian. We will describe the annotation… (More)
The availability of semantically tagged corpora is becoming a very important and urgent need for training and evaluation within a large number of applications but also they are the natural application and accompaniment of semantic lexicons of which they constitute both a useful testbed to evaluate their adequacy and a repository of corpus examples for the… (More)
In this paper we discuss how the ItalWordNet semantic database, being built by extending the Italian wordnet developed within the EuroWordNet project, is being exploited for the lexical semantic annotation of a corpus of Italian.
The paper reports on the lexico-semantic annotation level of the Italian Treebank, the rst Italian corpus with a multi-level annotation (morpho-syntactic, syntactic and lexico-semantic). The strategy of annotation and the reference lexical resource are described, and the results achieved too.