Ignasi Iriondo

Learn More
This paper describes the methodology used for validating the results obtained in a study about acoustical modelling of emotional expression in Castilian Spanish. We have obtained a set of rules that describes the behaviour of the most significant parameters of speech related with the emotional expression. The validation of the results of the study has been(More)
The aim of this article is to classify children’s affective states in a real-life non-prototypical emotion recognition scenario. The framework is the same as that proposed in the Interspeech 2009 Emotion Challenge. We used a large set of acoustic features and five linguistic parameters based on the concept of emotional salience. Features were extracted from(More)
In this work, the capability of voice quality parameters to discriminate among different expressive speech styles is analyzed. To that effect, the data distribution of these parameters, directly measured from the acoustic speech signal, is used to train a Linear Discriminant Analysis that conducts an automatic classification. As a result, the most relevant(More)
This paper presents a new approach for designing a concatenative text-to-speech (TTS) system based on multi-domain unit selection. The method achieves good synthetic quality with reasonable computational cost for a general-purpose TTS system. The architecture of the multi-domain database and the text classification algorithm for domain assignment are the(More)
 This paper proposes an approach to transform speech from a neutral style into other expressive styles using both prosody and voice quality (VoQ). The main aim is to validate the usefulness of VoQ in the enhancement of expressive synthetic speech. A Harmonic plus Noise Model (HNM) is used to modify speech following a set of rules extracted from an(More)
Hidden Markov Models based text-to-speech (HMM-TTS) synthesis is a technique for generating speech from trained statistical models where spectrum, pitch and durations of basic speech units are modelled altogether. The aim of this work is to describe a Spanish HMM-TTS system using CBR as a F0 esti-mator, analysing its performance objectively and(More)
El material de voz usado para realizar los experimentos sobre los nuevos procedimientos de medida del jitter y del shimmer, es el mismo que utiliza el CTH desarrollado por el GPMM [12], se trata de cinco corpus de habla expresiva (o emocionada): neutro, agresivo, alegre, sensual y triste; en español y grabados por una locutora profesional. En [6] se(More)
This paper presents the validation of the expres-siveness of an acted corpus produced to be used in speech synthesis, as this kind of emotional speech can be rather lacking in authenticity. The goal is to obtain a system which is able to prune bad utterances from an expressiveness point of view. The results from a previous subjective test are used for the(More)