Yan-You Chen

  • Citations Per Year
Learn More
In recent years, emotion-aware human-machine interactions have become an important issue. Most of the traditional researches focused on the use of different features and classification methods to improve emotion recognition rates. However, they still cannot recognize detailed and various emotions. Accordingly, in this paper, an emotion recognition system,(More)
This paper brings together speech recognition, emotion inference and virtual agents to implement a system for student interaction in an educational environment. By analyzing the capture speech, we can perceive an indication of the emotion status of the target student. Using the inference results, an agent can choose the suitable dialogue to interact with(More)
A systematic approach is proposed to synthesizing personalized spontaneous speech using a small-sized unsegmented speech corpus of the target speaker. First, an automatic segmentation algorithm is employed to segment and label a collected semispontaneous speech corpus of the target speaker. Then, a pretrained average voice model is adapted to the voice(More)
This study proposes a hybrid approach to natural-sounding speech synthesis based on candidate expansion, unit selection, and prosody adjustment using a small corpus. The proposed method is more specific to tonal language, in particular Mandarin. In conventional speech synthesis studies, the quality of synthesized speech depends heavily on the size of the(More)
As the growth of economy and technology has become increasingly rapid, mental care is getting more important today. However, recent movements, such green technologies, place more emphasis on environmental issues but less on mental care. Therefore, this paper presents a newly emerging technology called orange computing for mental care applications. Orange(More)
In control vector-based expressive speech synthesis, the emotion/style control vector defined in the categorical (CAT) emotion space is uneasy to be precisely defined by the user to synthesize the speech with the desired emotion/style. This paper applies the arousal-valence (AV) space to the multiple regression hidden semi-Markov model (MRHSMM)-based(More)
  • 1