Luis J. Rodríguez

Learn More
The Mel-Frequency Cepstral Coefficients (MFCC) and their derivatives are commonly used as acoustic features for speaker recognition. The issue arises of whether some of those features are redundant or dependent on other features. Probably, not all of them are equally relevant for speaker recognition. Reduced feature sets allow more robust estimates of the(More)
Previous works in English have revealed that disfluencies follow regular patterns and that incorporating them into the language model of a speech recognizer leads to lower perplexities and sometimes to a better performance. Although work on disfluency modeling has been applied outside the English community (e.g., in Japanese), as far as we know there is no(More)
This paper presents a new methodology, based on the classical decision tree classification scheme proposed by Bahl [1], to get a suitable set of context dependent sublexical units in Spanish continuous speech recognition tasks. The original method was applied as a first baseline approach. Then two new features were added: a discriminative function to(More)
This paper briefly describes the language recognition system developed by the Sofware Technology Working Group (http://gtts.ehu.es) at the University of the Basque Country in collaboration with IKERLAN Technological Research Center, and submitted to the NIST 2009 Language Recognition Evaluation. The system consists of a hierarchical fusion of individual(More)
This paper presents a new system for the continuous speech recognition of Spanish, integrating previous works in the fields of acoustic-phonetic decoding and language modelling. The system includes decision tree-based sublexical units and syntactic language models based on regular grammars. Acoustic and language models -separately trained with speech and(More)
BACKGROUND AND OBJECTIVE Diabetic retinopathy is a microvascular complication of diabetes mellitus whose prevalence is closely related to the presence of nephropathy and hypertension. The aim was to study clinical and pharmacological factors that are associated with an increased need for laser photocoagulation in patients with diabetic nephropathy and(More)
Finding audio and video resources in internet is becoming an increasingly demanded application. However, search engines are usually limited to adjacent texts (hand supplied transcripts or close captions) to index and classify multimedia documents. Clearly, a key advantage can be taken from using automatic speech recognition and natural language processing(More)
The integration into social and work environments of people with disabilities is a fact nowadays. Tutoring systems are intended for helping this community in their life. These tools are very helpful; although at the moment don't completely meet their needs. This work presents a robust and intelligent tutor system that will cover several new aspects, coping(More)
This paper presents a new system for the continuous speech recognition of Spanish, integrating previous works in the fields of acoustic-phonetic decoding and language modelling. Acoustic and language models -separately trained with speech and text samples, respectivelyare integrated into one single automaton, and their probabilities combined according to a(More)
  • 1