• Publications
  • Influence
Automatic phonetic segmentation
TLDR
The most frequently used approach-based on a modified Hidden Markov Model (HMM) phonetic recognizer is analyzed, and a general framework for the local refinement of boundaries is proposed, and the performance of several pattern classification approaches is compared within this framework.
Emulating DNA: Rigorous Quantification of Evidential Weight in Transparent and Testable Forensic Speaker Recognition
TLDR
It is shown how the evaluation of DNA evidence, which is based on a probabilistic similarity-typicality metric in the form of likelihood ratios (LR), can also be generalized to continuous LR estimation, thus providing a common framework for phonetic-linguistic methods and automatic systems.
Skin Detection -a Short Tutorial
TLDR
Several computer vision approaches have been developed for skin detection, which typically transforms a given pixel into an appropriate color space and then uses a skin classifier to label the pixel whether it is a ski n or a non-skin pixel.
BiosecurID: a multimodal biometric database
TLDR
A new multimodal biometric database, acquired in the framework of the BiosecurID project, is presented together with the description of the acquisition setup and protocol and features such as: realistic acquisition scenario, balanced gender and population distributions, availability of information about particular demographic groups, and compatibility with other existing databases.
An end-to-end approach to language identification in short utterances using convolutional neural networks
This work has been supported by project CMC-V2: Caracterizacion, Modelado y Compensacion de Variabilidad en la Senal de Voz (TEC2012-37585-C02-01), funded by Ministerio de Economia y Competitividad,
An analysis of the influence of deep neural network (DNN) topology in bottleneck feature based language recognition
TLDR
Analysis of language recognition results with different topologies for the DNN used to extract the bottleneck features, comparing them and against a reference system based on a more classical cepstral representation of the input signal with a total variability model to obtain useful knowledge about how the Dnn configuration influences bottleneck feature-based language recognition systems performance.
Synthetic Fingerprint Generation
Assessment of Severe Apnoea through Voice Analysis, Automatic Speech, and Speaker Recognition Techniques
TLDR
An acoustic search for distinctive apnoea voice characteristics is described and an 81% correct classification rate is achieved, which is very promising and underpins the interest in this line of inquiry.
Automatic alternative transcription generation and vocabulary selection for flexible word recognizers
TLDR
This work proposes the use of the new transcription confusability measure in two different word error rate (WER) reduction procedures for FVRs: an automatic vocabulary selection procedure suitable for those applications where the set of vocabulary words is not totally defined by the application, and an automatic procedure for generation of alternative transcriptions.
...
...