Data Set Used
In this paper we present a novel approach to multi–word terminology extraction combining a well–known automatic term recognition approach, the C–NC value method, with a contrastive ranking technique, aimed at refining obtained results either by filtering noise due to common words or by discerning between semantically different types of terms within… (More)
The entities mentioned in collections of scholarly articles in the Humanities (and in other scholarly domains) belong to different types from those familiar from news corpora, hence new resources need to be annotated to create supervised taggers for tasks such as ne extraction. However, in such domains there is a great need for making the best use possible… (More)
Modelling, analysis and synthesis of behaviour are the subject of major efforts in computing science, especially when it comes to technologies that make sense of human–human and human–machine interactions. This article outlines some of the most important issues that still need to be addressed to ensure substantial progress in the field, namely (1)… (More)
This study explores laughter distribution around topic changes in multiparty conversations. The distribution of shared and solo laughter around topic changes was examined in corpora containing two types of spoken interaction ; meetings and informal conversation. Shared laughter was significantly more frequent in the 15 seconds leading up to topic change in… (More)
Relational database (DB) management systems provide the standard means for structuring and querying large amounts of data. However , to access such data the exact structure of the DB must be know, and such a structure might be far from the conceptualization of a human being of the stored information. Ontologies help to bridge this gap, by providing a high… (More)
In this paper we tackle the challenging task of Multi-word term (MWT) extraction from different types of specialized corpora. Contrastive filtering of previously extracted MWTs results in a considerable increment of acquired domain-specific terms.