An overview of text-independent speaker recognition: From features to supervectors
- T. Kinnunen, Haizhou Li
- Computer ScienceSpeech Communication
- 2010
Text-dependent speaker verification: Classifiers, databases and RSR2015
- A. Larcher, Kong-Aik Lee, B. Ma, Haizhou Li
- Computer ScienceSpeech Communication
- 1 May 2014
A Joint Source-Channel Model for Machine Transliteration
- Haizhou Li, Min Zhang, Jian Su
- Computer ScienceAnnual Meeting of the Association for…
- 21 July 2004
A new framework that allows direct orthographical mapping between two different languages, through a joint source-channel model, also called n-gram transliteration model (TM), which greatly reduces system development effort and provides a quantum leap in improvement in transliterations accuracy over that of other state-of-the-art machine learning algorithms.
Spoken Language Recognition: From Fundamentals to Practice
- Haizhou Li, B. Ma, Kong-Aik Lee
- Computer ScienceProceedings of the IEEE
- 6 February 2013
This paper attempts to provide an introductory tutorial on the fundamentals of the theory and the state-of-the-art solutions of spoken language recognition, from both phonological and computational aspects.
Spoofing and countermeasures for speaker verification: A survey
- Zhizheng Wu, N. Evans, T. Kinnunen, J. Yamagishi, F. Alegre, Haizhou Li
- Computer ScienceSpeech Communication
- 1 February 2015
A learning-based approach to direction of arrival estimation in noisy and reverberant environments
- Xiong Xiao, Shengkui Zhao, X. Zhong, Douglas L. Jones, Chng Eng Siong, Haizhou Li
- Computer ScienceIEEE International Conference on Acoustics…
- 19 April 2015
A learning-based approach that can learn from a large amount of simulated noisy and reverberant microphone array inputs for robust DOA estimation and uses a multilayer perceptron neural network to learn the nonlinear mapping from such features to the DOA.
Language Identification: A Tutorial
- E. Ambikairajah, Haizhou Li, Liang Wang, Bo Yin, V. Sethu
- Computer Science, LinguisticsIEEE Circuits and Systems Magazine
- 9 June 2011
This tutorial presents an overview of the progression of spoken language identification (LID) systems and current developments, and Evaluations of the LID system are presented using NIST language recognition evaluation tasks.
A Vector Space Modeling Approach to Spoken Language Identification
- Haizhou Li, B. Ma, Chin-Hui Lee
- Computer ScienceIEEE Transactions on Audio, Speech, and Language…
- 2007
The proposed VSM approach leads to a discriminative classifier backend, which is demonstrated to give superior performance over likelihood-based n-gram language modeling (LM) backend for long utterances.
Low-Variance Multitaper MFCC Features: A Case Study in Robust Speaker Verification
- T. Kinnunen, R. Saeidi, Haizhou Li
- Computer ScienceIEEE Transactions on Audio, Speech, and Language…
- 1 September 2012
This paper provides detailed statistical analysis of MFCC bias and variance using autoregressive process simulations on the TIMIT corpus and proposes the multitaper method for MFCC extraction with a practical focus.
Making Social Robots More Attractive: The Effects of Voice Pitch, Humor and Empathy
- Andreea Niculescu, B. V. Dijk, A. Nijholt, Haizhou Li, See Swee Lan
- Computer ScienceInt. J. Soc. Robotics
- 16 January 2013
The results showed that the voice pitch seemed to have a strong influence on the way users rated the overall interaction quality, as well as the robot’s appeal and overall enjoyment.
...
...