Detecting incorrectly-segmented utterances for posteriori restoration of turn-taking and ASR results

Abstract

Appropriate turn-taking is important in spoken dialogue systems as well as generating correct responses. We have developed a method that performs a posteriori restoration of incorrectly segmented utterances caused by erroneous voice activity detection (VAD), which result in automatic speech recognition (ASR) errors and inappropriate turn-taking. A crucial… (More)

Topics

9 Figures and Tables