A multi-stream audio-video large-vocabulary Mandarin Chinese speech database

Abstract

We present the acquisition and content of a multi-stream audio-visual large-vocabulary database in Mandarin Chinese. The database consists of 17,000 utterances spoken by 225 people and captured by a set of seven cameras and 12 microphones. We also provide the label files that describe the endpoints of the utterances and the script files that represent the… (More)

3 Figures and Tables

Topics

  • Presentations referencing similar topics