Anita Alicante

Learn More
The aim of this paper is to contribute to the debate on the issues raised by Morphologically Rich Languages, and more precisely to investigate, in a cross-paradigm perspective, the influence of the constituent order on the data-driven parsing of one of such languages (i.e. Italian). It shows therefore new evidence from experiments on Italian, a language(More)
This paper proposes and discusses the use of text mining techniques for the extraction of information from clinical records written in Italian. However, as it is very difficult and expensive to obtain annotated material for languages different from English, we only consider unsupervised approaches, where no annotated training set is necessary. We therefore(More)
Identifier names play a key role in program understanding and in particular in concept location. Programmers can easily “parse” identifiers and understand the intended meaning. This, however, is not trivial for tools that try to exploit the information in the identifiers to support program understanding. To address this problem, we resort to natural(More)
In this work we consider the problem of extracting concepts and relations between them from documents, aiming at constructing an index for a more semantically oriented search engine. While assessment is performed on a biomedical application, the proposed solutions can be also applied to different domains. With the distributed architecture proposed, we(More)
This paper discusses the application of an unsupervised text mining technique for the extraction of information from clinical records in Italian. The approach includes two steps. First of all, a metathesaurus is exploited together with natural language processing tools to extract the domain entities. Then, clustering is applied to explore relations between(More)
Healthcare domain is characterized by a huge amount of data, contained in medical records, reports, test results and so on. In order to give support to healthcare workers and manage relevant data in effective and efficient way, it is important to correctly classify the unstructured parts of text, embedded in the medical documents. In this paper, we propose(More)
English. A crucial point for the applicability of sentiment analysis over Twitter is represented by the degree of manual intervention necessary to adapt the approach to the considered domain. In this work we propose a new sentiment polarity classifier exploiting barrier features, originally introduced for the classification of textual data. Empirical tests(More)