Learn More
We address the problem of categorising documents using kernel-based methods such as Support Vector Machines. Since the work of Joachims (1998), there is ample experimental evidence that SVM using the standard word frequencies as features yield state-of-the-art performance on a number of benchmark problems. Recently , Lodhi et al. (2002) proposed the use of(More)
This work presents a new perspective on characterizing the similarity between elements of a database or, more generally, nodes of a weighted and undirected graph. It is based on a Markov-chain model of random walk through the database. More precisely, we compute quantities (the average commute time, the pseudoinverse of the Laplacian matrix of the graph,(More)
We present a geometric view on bilingual lexicon extraction from comparable corpora, which allows to re-interpret the methods proposed so far and identify unresolved problems. This motivates three new methods that aim at solving these problems. Empirical evaluation shows the strengths and weaknesses of these methods, as well as a significant gain in the(More)
This work presents a new perspective on characterizing the similarity between elements of a database or, more generally, nodes of a weighted, undi-rected, graph. It is based on a Markov-chain model of random walk through the database. More precisely, we compute quantities (the average commute time, the pseudoinverse of the Laplacian matrix of the graph,(More)
This paper presents a new technique for the calibration of robots based on a maximum likelihood approach for the identification of geometrical errors. A new experimental setup is presented for measurement of the end-effector position errors. The errors of position and orientation of the measuring device are included in the algorithm and identified. Tests(More)
OBJECTIVES We present in this article experiments on multi-language information extraction and access in the medical domain. For such applications, multilingual terminology plays a crucial role when working on specialized languages and specific domains. MATERIAL AND METHODS We propose firstly a method for enriching multilingual thesauri which extracts new(More)
This document describes XRCE's participation to Imageval, more specifically to the mixed Text-Image search. After reviewing state-of-the-art methods to exploit the correlations between texts and images in multimedia retrieval, we will examine the single-media search components and describe how we have combined them in the framework of ImagEval. It appeared(More)
This work presents some general procedures for computing dissimilarities between elements of a database or, more generally, nodes of a weighted, undi-rected, graph. It is based on a Markov-chain model of random walk through the database. The model assigns transition probabilities to the links between elements , so that a random walker can jump from element(More)