José A. Vallejo

Learn More
Accurate prediction of segmental duration from text in a text-to-speech system is difficult for several reasons. One specially relevant is the great quantity of contextual factors that affect timing and how to model them. There are many parameters that affect duration, but not all of them are always relevant. We present a complete environment in which to(More)
In this paper, we describe the development of a female voice in a Restricted-Domain Speech Synthesis System for Spanish. For the design of the database, we have used a greedy-algorithm approach that focus not only on covering a set of target phonemes, but also on mimicking the histogram of prosodic features from a larger database. For modeling the prosody,(More)
Accurate prediction of segmental duration from text in a text-to-speech system is difficult for several reasons. One which is especially relevant is the great quantity of contextual factors that affect timing and it is difficult to find the right way to model them. There are many parameters that affect duration, but not all of them are always relevant and(More)
We present a multi-speaker formant synthesizer based on parameter concatenation. The user can choose among three speakers, two males and one female. The synthesizer stores all the parameters for the basic speaker and linear transformation functions to synthesized the other two. The complete database for one speaker consists of 455 parameterized units(More)
Several methods to improve multiple distant microphone (MDM) speaker diarization based on Time Delay of Arrival (TDOA) features are evaluated in this paper. All of them avoid the use of a single reference channel to calculate the TDOA values and, based on different criteria, select among all possible pairs of microphones a set of pairs that will be used to(More)
  • 1