Speaker identification using pseudo pitch synchronized phase information in noisy environments

Abstract

In conventional speaker identification methods based on mel-frequency cepstral coefficients (MFCCs), phase information is ignored. Recent studies have shown that phase information contains speaker dependent characteristics, and, pitch synchronous phase information is more suitable for speaker identification. In this paper, we verify the effectiveness of pitch synchronous phase information for speaker identification in noisy environments. Experiments were conducted using the JNAS (Japanese Newspaper Article Sentence) database. The pseudo pitch synchronized phase information based method achieved a relative speaker identification error reduction rate of 15.5% compared to the conventional phase information (that is pitch non-synchronized phase). By cutting frames with low power and combining phase information with MFCC, a furthermore improvement was obtained.

DOI: 10.1109/APSIPA.2013.6694385

Extracted Key Phrases

7 Figures and Tables

Cite this paper

@article{Kawakami2013SpeakerIU, title={Speaker identification using pseudo pitch synchronized phase information in noisy environments}, author={Yuta Kawakami and Longbiao Wang and Seiichi Nakagawa}, journal={2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference}, year={2013}, pages={1-4} }