Voting-Based Backchannel Timing Prediction Using Audio-Visual Information

Abstract

While many spoken dialog systems are recently developed, users need to summarize and convey what they want the system to do clearly. However, in a human dialog, a speaker often summarize what to say incrementally, provided that there is a good listener who responds to the speaker's utterances at appropriate timing. We consider that generating backchannel… (More)
DOI: 10.1145/2974804.2980501

Topics

3 Figures and Tables

Slides referencing similar topics