Analysis of an MFCC-based audio indexing system for efficient coding of multimedia sources

@article{Mubarak2005AnalysisOA,
  title={Analysis of an MFCC-based audio indexing system for efficient coding of multimedia sources},
  author={Omer Mohsin Mubarak and Eliathamby Ambikairajah and Julien Epps},
  journal={Proceedings of the Eighth International Symposium on Signal Processing and Its Applications, 2005.},
  year={2005},
  volume={2},
  pages={619-622}
}
Discrimination between speech and music signals is an important problem in efficient digital radio broadcasting, particularly for variable bit rate applications such as Internet radio. This paper presents a speech/music discrimination system based on a Mel frequency cepstral coefficient (MFCC) front end and a GMM classifier. This system can be used to select the optimum coding scheme for the current frame of an input signal without knowing a priori whether it contains speech-like or music-like… CONTINUE READING

Figures, Results, and Topics from this paper.

Key Quantitative Results

  • For the 46 minute evaluation database used in this experiment, an accuracy of up to 97.14% for music and 93.87% for speech can be attained.

Explore Further: Topics Discussed in This Paper

Similar Papers

Loading similar papers…