Learn More
The aims of the SpeechDat-Car project are to develop a set of speech databases to support training and testing of multilingual speech recognition applications in the car environment. As a result, a total of ten (10) equivalent and similar resources will be created. The 10 languages are Danish, each language 600 sessions will be recorded (from at least 300(More)
SpeechRecorder is a platform independent audio recording software for speech corpus recordings. It is implemented in Java in a clean object-oriented design and adheres to established technology standards and document interchange formats. SpeechRecorder allows Unicode text and multimedia prompts, it supports audio recordings via more than two channels, and(More)
The goal of the SpeechDat project is to develop spoken language resources for speech recognisers suited to realise voice driven teleservices. SpeechDat created speech databases for all official languages of the European Union and some major dialectal varieties and minority languages. The size of the databases ranges between 500 and 5000 speakers. In total(More)
The main purpose of this study was to compare acoustically the vowel spaces of two groups of cochlear implantees (CI) with two age-matched normal hearing groups. Five young test persons (15-25 years) and five older test persons (55-70 years) with CI and two control groups of the same age with normal hearing were recorded. The speech material consisted of(More)
We describe the pronunciation model of the automatic segmen-tation technique MAUS based on a data-driven Markov process and a new evaluation measure for phonemic transcripts relative symmetric accuracy; results are given for the MAUS segmenta-tion and labelling on German dialog speech. MAUS is currently distributed as a freeware package by the Bavarian(More)
WWWTranscribe is a transcription system based on the WWW. It is platform independent and allows network access to speech databases. Its modular structure make it flexible, and it connects easily to existing signal processing applications or database management systems. WWWTranscribe consists of static HTML documents containing forms. To these forms CGI(More)
With the globalisation and evolving technology of voice-driven man-machine interfaces there is a growing demand for acquisition of spoken language resources in a number of speaker populations being representative for a number of languages and countries. In this paper experience from work within a large consortium in creating large multilingual speech(More)
In this article experiences in creating large multilingual speech databases for teleservices within a large consortium are reported in order to inspire, to facilitate or to compare the setup and progress of other enterprises for collecting large speech databases. The focus will be on following aspects: Objectives, benefits, and strategy; project(More)