Ascensión Gallardo-Antolín

Learn More
Hidden Markov Models (HMMs) are, undoubtedly, the most employed core technique for Automatic Speech Recognition (ASR). Nevertheless, we are still far from achieving high-performance ASR systems. Some alternative approaches, most of them based on Artificial Neural Networks (ANNs), were proposed during the late eighties and early nineties. Some of them(More)
The complexity of the current global information infrastructure requires novel means of understanding and exploiting the dynamics of information. One means may be through the concept of an information ecosystem. An information ecosystem is analo gous to a natural ecosystem in which there are flo ws of materials and energy analo gous to information flow(More)
+P VJKU RCRGT YG FGUETKDG VJG UVGRU ECTTKGF QWV HQT FGXGNQRKPI C 4CKNYC[ +PHQTOCVKQP 5GTXKEG HQT 5RCPKUJ 6JKU YQTM KPVTQFWEGU C OGVJQFQNQI[ HQT FGUKIPKPI FKCNQIWG OCPCIGTU KP URQMGP FKCNQIWG U[UVGOU HQT TGUVTKEVGF FQOCKPU +P VJKU OGVJQFQNQI[ UGXGTCN UQWTEGU QH KPHQTOCVKQP CTG EQODKPGF KPVWKVKQP QDUGTXCVKQP CPF UKOWNCVKQP HQT FGHKPKPI UGXGTCN FKCNQIWG(More)
The Internet Protocol (IP) environment poses two relevant sources of distortion to the speech recognition problem: lossy speech coding and packet loss. In this paper, we propose a new front-end for speech recognition over IP networks. Specifically, we suggest extracting the recognition feature vectors directly from the encoded speech (i.e., the bit stream)(More)
The improved theoretical properties of Support Vector Machines with respect to other machine learning alternatives due to their max-margin training paradigm have led us to suggest them as a good technique for robust speech recognition. However, important shortcomings have had to be circumvented, the most important being the normalisation of the time(More)
In this paper, we have extended our previous research on a new approach to ASR in the GSM environment. Instead of recognizing from the decoded speech signal, our system works from the digital speech representation used by the GSM encoder. We have compared the performance of a conventional system and the one we propose on a speaker independent, isolateddigit(More)
This paper addresses the problem of speech recognition in the GSM environment. In this context, new sources of distortion, such as transmission errors or speech coding itself, significantly degrade the performance of speech recognizers. While conventional approaches deal with these types of distortion after decoding speech, we propose to recognize from the(More)
In the context of speech and speaker recognition systems, it is well known that the combination of different feature streams can improve significantly their performance. However, the application of multi-stream (MS) techniques to speaker diarization systems has not been extensively studied. In this paper, we address this issue: we formulate different MS(More)