Classification of vocal and non-vocal regions from audio songs using spectral features and pitch variations

@article{Murthy2015ClassificationOV,
  title={Classification of vocal and non-vocal regions from audio songs using spectral features and pitch variations},
  author={Y. V. Srinivasa Murthy and Shashidhar G. Koolagudi},
  journal={2015 IEEE 28th Canadian Conference on Electrical and Computer Engineering (CCECE)},
  year={2015},
  pages={1271-1276}
}
In this work, an effort has been made to identify vocal and non-vocal regions from a given song using signal processing techniques and machine learning algorithm. Initially spectral features like mel-frequency cepstral coefficients (MFCCs) are used to develop the baseline system. Statistical values of pitch, jitter and shimmer are considered to improve performance of the system. Artificial neural networks (ANNs) are used to capture the characteristics of vocal and non-vocal segments of the… CONTINUE READING
Highly Cited
This paper has 17 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-7 of 7 extracted citations

Vocal and Non-vocal Segmentation based on the Analysis of Formant Structure

2017 Ninth International Conference on Advances in Pattern Recognition (ICAPR) • 2017
View 4 Excerpts
Highly Influenced

Content-based audio classification and retrieval: A novel approach

2016 International Conference on Global Trends in Signal Processing, Information Computing and Communication (ICGTSPICC) • 2016
View 2 Excerpts
Method Support

Detection of largest possible repeated patterns in Indian audio songs using spectral features

2016 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE) • 2016
View 1 Excerpt

References

Publications referenced by this paper.
Showing 1-10 of 38 references

An introduction to audio content analysis: Applications in signal processing and music informatics

Alexander Lerch
2012
View 1 Excerpt

Automatic singer identification based on auditory features

2011 Seventh International Conference on Natural Computation • 2011
View 2 Excerpts

Model-based Classification of Speech Audio

Chris Thoman
ProQuest, • 2009
View 1 Excerpt

Singing voice detection in music tracks using direct voice vibrato detection

2009 IEEE International Conference on Acoustics, Speech and Signal Processing • 2009
View 1 Excerpt

A Regression Approach to Music Emotion Recognition

IEEE Transactions on Audio, Speech, and Language Processing • 2008
View 1 Excerpt