Optimization of data-driven filterbank for automatic speaker verification

@article{Sarangi2020OptimizationOD,
  title={Optimization of data-driven filterbank for automatic speaker verification},
  author={Susanta Sarangi and M. Sahidullah and G. Saha},
  journal={Digit. Signal Process.},
  year={2020},
  volume={104},
  pages={102795}
}
  • Susanta Sarangi, M. Sahidullah, G. Saha
  • Published 2020
  • Computer Science, Engineering
  • Digit. Signal Process.
  • Abstract Most of the speech processing applications use triangular filters spaced in mel-scale for feature extraction. In this paper, we propose a new data-driven filter design method which optimizes filter parameters from a given speech data. First, we introduce a frame-selection based approach for developing speech-signal-based frequency warping scale. Then, we propose a new method for computing the filter frequency responses by using principal component analysis (PCA). The main advantage of… CONTINUE READING
    2 Citations

    References

    SHOWING 1-10 OF 90 REFERENCES
    A novel approach in feature level for robust text-independent speaker identification system
    • S. K. Sarangi, G. Saha
    • Computer Science
    • 2012 4th International Conference on Intelligent Human Computer Interaction (IHCI)
    • 2012
    • 14
    Data-driven spectral basis functions for automatic speech recognition
    • 23
    Optimization of temporal filters for constructing robust features in speech recognition
    • Jeih-Weih Hung, L. Lee
    • Mathematics, Computer Science
    • IEEE Transactions on Audio, Speech, and Language Processing
    • 2006
    • 66
    • PDF
    Mean Hilbert envelope coefficients (MHEC) for robust speaker and language identification
    • 47
    Power-Normalized Cepstral Coefficients (PNCC) for Robust Speech Recognition
    • Chanwoo Kim, R. Stern
    • Computer Science
    • IEEE/ACM Transactions on Audio, Speech, and Language Processing
    • 2016
    • 315
    • PDF
    Data Driven Design of Filter Bank for Speech Recognition
    • 19
    A perceptually-motivated low-complexity instantaneous linear channel normalization technique applied to speaker verification
    • 10
    • PDF
    Data-Driven Temporal Filters and Alternatives to GMM in Speaker Verification
    • 30
    • PDF