Makoto Hiroshige

Learn More
To extract elements of prosodic features which relate to speakers' intentional control is required for speech information processing. Speech rate variation should be a "caution signal" to call listeners' attention strongly. To express and detect such "caution signals", we have proposed a new speech rate model. This model introduces two kinds of force to(More)
The variable threshold(VT), which detects the speech rate deceleration, is proposed. The VT varies dynamically depending upon the duration of previous mora in the utterance. The VT should not change rapidly because listener cannot perceive small variations of mora duration. Thus, a set of functions with time constants which decide response speed of the VT(More)
This paper proposes and evaluates a new direct speech transform method with waveforms from laryngectomee speech to normal speech. Most conventional speech recognition systems and speech processing systems are not able to treat laryngectomee speech with satisfactory results. One of the major causes is difficulty preparing corpora. It is very hard to record a(More)
This paper evaluates a direct speech translation Method with waveforms using the Inductive Learning method for short conversation. The method is able to work without conventional speech recognition and speech synthesis because syntactic expressions are not needed for translation in the proposed method. We focus only on acoustic characteristics of speech(More)
We are aiming to detect local deceleration of Japanese spontaneous conversational speech. We have proposed the variable threshold (VT), which detects local speech rate deceleration from the sequence of time series of mora duration. In this paper, we add a constant term to the VT to detect local deceleration appropriately. The VT is applied to 167 samples of(More)
  • 1