• Publications
  • Influence
Feature warping for robust speaker verification
We propose a novel feature mapping approach that is robust to channel mismatch, additive noise and to some extent, non-linear effects attributed to handset transducers. These adverse effects canExpand
  • 751
  • 32
  • PDF
Texture for script identification
The problem of determining the script and language of a document image has a number of important applications in the field of document analysis, such as indexing and sorting of large collections ofExpand
  • 180
  • 16
A Database for Person Re-Identification in Multi-Camera Surveillance Networks
Person re-identification involves recognising individuals in different locations across a network of cameras and is a challenging task due to a large number of varying factors such as pose (bothExpand
  • 108
  • 15
  • PDF
Automatically Detecting Pain in Video Through Facial Action Units
In a clinical setting, pain is reported either through patient self-report or via an observer. Such measures are problematic as they are: 1) subjective, and 2) give no specific timing information.Expand
  • 193
  • 14
  • PDF
Gait energy volumes and frontal gait recognition using depth images
Gait energy images (GEIs) and its variants form the basis of many recent appearance-based gait recognition systems. The GEI combines good recognition performance with a simple implementation, thoughExpand
  • 106
  • 12
  • PDF
i-vector Based Speaker Recognition on Short Utterances
Robust speaker verification on short utterances remains a key consideration when deploying automatic speaker recognition, as many real world applications often have access to only limited durationExpand
  • 214
  • 12
  • PDF
Explicit modelling of session variability for speaker verification
This article describes a general and powerful approach to modelling mismatch in speaker recognition by including an explicit session term in the Gaussian mixture speaker modelling framework. UnderExpand
  • 132
  • 12
  • PDF
Least squares congealing for unsupervised alignment of images
In this paper, we present an approach we refer to as ldquoleast squares congealingrdquo which provides a solution to the problem of aligning an ensemble of images in an unsupervised manner. OurExpand
  • 83
  • 12
  • PDF
The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms
The QUT-NOISE-TIMIT corpus consists of 600 hours of noisy speech sequences designed to enable a thorough evaluation of voice activity detection (VAD) algorithms across a wide variety of commonExpand
  • 81
  • 12
  • PDF
Person-independent facial expression detection using Constrained Local Models
In automatic facial expression detection, very accurate registration is desired which can be achieved via a deformable model approach where a dense mesh of 60–70 points on the face is used, such asExpand
  • 122
  • 10
  • PDF