Automatic classification of speaker characteristics
@article{Nguyen2010AutomaticCO, title={Automatic classification of speaker characteristics}, author={Phuoc Nguyen and Dat T. Tran and Xu Huang and Dharmendra Sharma}, journal={International Conference on Communications and Electronics 2010}, year={2010}, pages={147-152} }
An automatic voice-based classification system of speaker characteristics including age, gender and accent is presented in this paper. Speakers are grouped according to their characteristics and their speech features are then extracted to train speaker group models using different classification techniques. Finally fusion of classification results for those speaker groups is performed to obtain results for each speaker characteristic. The ANDOSL Australian speech database consisting of 108…
7 Citations
A Survey Paper on Gender Identification System using Speech Signal
- Computer Science
- 2017
This paper provides a survey of automatic human gender identification using speech signal characteristics and classifiers and highlights of selection of speech features, their processing and different classifiers used for this purpose are discussed.
A Survey of Speaker Recognition: Fundamental Theories, Recognition Methods and Opportunities
- Computer ScienceIEEE Access
- 2021
This literature survey gives a concise introduction to ASR and provides an overview of the general architectures dealing with speaker recognition technologies, and upholds the past, present, and future research trends in this area.
Computational Assessment of Interest in Speech—Facing the Real-Life Challenge
- Computer ScienceKI - Künstliche Intelligenz
- 2011
A fully automatic combination of brute-forced acoustic features, linguistic analysis, and non-linguistic vocalizations, exploiting cross-entity information in an early feature fusion is introduced.
Semantic Speech Tagging: Towards Combined Analysis of Speaker Traits
- Linguistics, Computer ScienceSemantic Audio
- 2011
This paper deals with the question how further paralinguistic information, such as speaker age, height, or race can provide beneficial information when their ground truth knowledge is provided within single-task speaker classification.
Study of Word-Level Accent Classication and Gender Factors
- Computer Science
- 2013
This work proposes to use stacked ensemble classier to classify gender rstly and then classify accent to improve accuracy, and results show that HMM-MFCC models show promising performance.
Acoustic correlates for perceived effort levels in male and female acted voices.
- PhysicsThe Journal of the Acoustical Society of America
- 2017
Perception-grounded male and female acoustic feature sets which tracked the actors' expressive effort levels through the continuum of whispered, breathy, modal, and resonant speech are presented and validated via multiple models.
Age and Gender Recognition for Speech Applications based on Support Vector Machines
- Computer Science
- 2014
References
SHOWING 1-10 OF 32 REFERENCES
Automatic estimation of one's age with his/her speech based upon acoustic modeling techniques of speakers
- Physics2002 IEEE International Conference on Acoustics, Speech, and Signal Processing
- 2002
A technique which automatically estimates speakers' age only with acoustic, not linguistic, information of their utterances is proposed, showing high correlation between speakers'Age estimated subjectively by humans and automatically calculated score of ‘agedness’.
Comparison of Four Approaches to Age and Gender Recognition for Telephone Applications
- Computer Science2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07
- 2007
A comparative study of four different approaches to automatic age and gender classification using seven classes on a telephony speech task and also compares the results with human performance on the same data.
Voice signatures
- Computer Science2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721)
- 2003
Two approaches for extracting speaker traits are investigated: the first focuses on general acoustic and prosodic features, the second on the choice of words used by the speaker, showing that voice signatures are of practical interest in real-world applications.
Higher-Level Features in Speaker Recognition
- Computer ScienceSpeaker Classification
- 2007
This article briefly summarizes approaches to using higher-level features for text-independent speaker verification over the last decade in terms of their type, temporal span, and reliance on automatic speech recognition for both feature extraction and feature conditioning.
Robust text-independent speaker identification using Gaussian mixture speaker models
- Computer ScienceIEEE Trans. Speech Audio Process.
- 1995
The individual Gaussian components of a GMM are shown to represent some general speaker-dependent spectral shapes that are effective for modeling speaker identity and is shown to outperform the other speaker modeling techniques on an identical 16 speaker telephone speech task.
Speaker Characteristics
- LinguisticsSpeaker Classification
- 2007
In this chapter, we give a brief introduction to speech-driven applications in order to motivate whyit is desirable to automatically recognize particular speaker characteristics from speech. Starting…
Performance of Speaker-independent Speech Recognisers for Automatic Recognition of Australian English
- Linguistics
- 2006
This paper investigates the performance of three speaker-independent speech recognisers (SISRs) that support continuous speech and are currently available for speaker-independent recognition of…
Fusing high- and low-level features for speaker recognition
- Computer ScienceINTERSPEECH
- 2003
It is shown how novel features and classifiers provide complementary information and can be fused together to drive down the equal error rate on the 2001 NIST Extended Data Task to 0.2%—a 71% relative reduction in error over the previous state of the art.
Acoustic Analysis of Adult Speaker Age
- PhysicsSpeaker Classification
- 2007
This chapter offers an introduction to the phonetic study of speaker age, with focus on what is known about the acoustic features which vary with age.
Automatic accent classification of foreign accented Australian English speech
- LinguisticsProceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96
- 1996
An automatic classification system for foreign accents in Australian English (AuE) speech based on accent-dependent parallel phoneme recognition (PPR) has been developed and is novel in that it does not require manually labelled accented data to be trained.