This paper describes LVCSR research into the automatic transcription of spontaneous Czech speech in the MALACH (Multilingual Access to Large Spoken Archives) project. This project attempts to provide improved access to the large multilingual spoken archives collected by the Survivors of the Shoah Visual History Foundation (VHF) (www.vhf.org) by advancing… (More)
In this paper, three speaker verification procedures are tested. All the procedures are based on Gaussian mixture models (GMM), however, they differ in the way, in which they use particular feature vectors of an utterance for speaker verification. A lot of experiments have been performed in a group of 329 speakers. The results showed that there is a… (More)
Presented paper takes interest in a speaker identification problem. The attributes representing voice of a particular speaker are obtained from very short segments of the speech waveform corresponding only to one pitch period of vowels. The patterns formed from the samples of a pitch period waveform are either matched in time domain by use of a nonlinear… (More)
The article introduces an expert system for the speaker verification task. Our main purpose was to design a tool for the combination of various speaker verification systems proposed for various operating conditions. First of all, the essential ideas are explained that made us design the expert system. Next section describes the structure of a rule-based… (More)
The paper present the issues encountered in processing spontaneous Czech speech in the MALACH project. Specific problems connected with a frequent occurrence of colloquial words in spontaneous Czech are analyzed; a partial solution is proposed and experimentally evaluated.
In this paper we describe the initial stages of the ASR component of the MALACH (Multilingual Access to Large Spoken Archives) project. This project will attempt to provide improved access to the large multilingual spoken archives collected by the Survivors of the Shoah Visual History Foundation (VHF) by advancing the state of the art in automated speech… (More)
In this paper methods are proposed which can be used to select a set of phonetically balanced sentences. The principle of the methods is presented and some experimental results are given. In the end of the paper the use of the proposed methods for the Czech read-speech corpus design is described in detail and the structure of the corpus is explained.
The paper deals with the problem of automatic speaker change detection. A metric-based algorithm, called MDISTBIC, which means Modified DISTBIC, is proposed in this paper. The algorithm originates from the DISTBIC algorithm and modifies it in order to reach a higher efficiency. Both the DISTBIC and the MDISTBIC methods are tested in a number of experiments.… (More)