Author pages are created from data sourced from our academic publisher partnerships and public sources.
Share This Author
MEAN TEACHER WITH DATA AUGMENTATION FOR DCASE 2019 TASK 4 Technical Report
A mean-teacher model with convolutional neural network (CNN) and recurrent neuralnetwork (RNN) together with data augmentation and a median window tuned for each class based on prior knowledge is proposed.
From a Wizard of Oz experiment to a real time speech and gesture multimodal interface
Frame-synchronous stochastic matching based on the Kullback-Leibler information
- L. Delphin-Poulat, C. Mokbel, J. Idier
- Computer ScienceProceedings of the IEEE International Conference…
- 12 May 1998
This work chooses to model speech by hidden Markov models (HMMs) in the cepstrum domain and the mismatch is reduced by a parametric function, and presents a frame synchronous estimation of these parameters.
Online SLU model adaptation with a partial oracle
A supervised approach for updating the SLU models of a deployed SDS which doesn’t need any additional manual transcription or annotation processes and is given by the users calling the SDS.
Gaussian density tree structure in a multi-Gaussian HMM-based speech recognition system
This paper presents a Gaussian density tree structure usage which enables a computational cost reduction without a significant degradation of recognition performances, during a continuous speech…
Frame-synchronous adaptation of cepstrum by linear regression
- L. Delphin-Poulat, C. Mokbel
- Computer ScienceIEEE Workshop on Automatic Speech Recognition and…
- 14 December 1997
Recognition experiments carried out on both PSTN and GSM networks show the efficiency of the proposed method: with a model trained on PSTN recorded digits, the error rate can be reduced with bias subtraction and by 36% with linear regression.
Robust speech recognition techniques evaluation for telephony server based in-car applications
- L. Delphin-Poulat
- PhysicsIEEE International Conference on Acoustics…
- 17 May 2004
The feasibility of designing a speech-recognition based telephony server for in-car applications with an acceptable recognition rate is investigated and the gain of using either a robust sound recording device or noise robust front-end is demonstrated.
Exploiting semantic relations for a spoken language understanding application
This article proposes a new confidence measure estimated for concept hypotheses provided by a semantic language model used in the context of a dialog application based upon the ontology and more precisely, upon the semantic relations between concepts.
Signal bias removal using the multi-path stochastic equalization technique
This work applies the MUlti-path Stochastic Equalization framework to perform bias removal in the cepstral domain in order to increase the robustness of automatic speech recognizers.
About improving recognition of spontaneously uttered French city-names
- D. Jouvet, K. Bartkova, Christophe Raix
- Computer ScienceIEEE International Conference on Acoustics…
- 6 April 2003
This paper deals with the recognition of French city-names over the telephone, which involves a 40,000 city-name vocabulary, ranging from short monosyllabic words to long official compound-names, and several ways of improving speech recognition performance are investigated.