Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA

Abstract

In the framework of the DIHANA project, we present the acquisition process of a spontaneous speech dialogue corpus in Spanish. The selected application consists of information retrieval by telephone for nationwide trains. A total of 900 dialogues from 225 users were acquired using the Wizard of Oz technique. In this work, we present the design and planning of the dialogue scenes and the wizard strategy used for the acquisition of the corpus. Then, we also present the acquisition tools and a description of the acquisition process.

5 Figures and Tables

Statistics

0102020072008200920102011201220132014201520162017
Citations per Year

76 Citations

Semantic Scholar estimates that this publication has 76 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@inproceedings{Bened2006DesignAA, title={Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA}, author={Jos{\'e}-Miguel Bened{\'i} and Eduardo Lleida and Amparo Varona and Mar{\'i}a Jos{\'e} Castro Bleda and Isabel Galiano and Raquel Justo and I{\~n}igo L{\'o}pez de Letona and Antonio Miguel}, booktitle={LREC}, year={2006} }