Singing Pitch Extraction by Voice Vibrato / Tremolo Estimation and Instrument Partial Deletion
@inproceedings{Hsu2010SingingPE, title={Singing Pitch Extraction by Voice Vibrato / Tremolo Estimation and Instrument Partial Deletion}, author={Chao-Ling Hsu and Jyh-Shing Roger Jang}, booktitle={International Society for Music Information Retrieval Conference}, year={2010} }
This paper proposes a novel and effective approach to extract the pitches of the singing voice from monaural polyphonic songs. The sinusoidal partials of the musical audio signals are first extracted. The Fourier transform is then applied to extract the vibrato/tremolo information of each partial. Some criteria based on this vibrato/tremolo information are employed to discriminate the vocal partials from the music accompaniment partials. Besides, a singing pitch trend estimation algorithm which…
42 Citations
Efficient Vocal Melody Extraction from Polyphonic Music Signals
- Computer Science
- 2013
The quantitative evaluation shows that the proposed system for automatic vocal melody extraction from polyphonic music recordings not only keeps the overall accuracy compared with the state-of-the-art approaches submitted to MIREX, but also achieves high algorithm efficiency.
Singing Voice Separation and Pitch Extraction from Monaural Polyphonic Audio Music via DNN and Adaptive Pitch Tracking
- Computer Science2016 IEEE Second International Conference on Multimedia Big Data (BigMM)
- 2016
A novel and effective two-stage approach to singing pitch extraction, which involves singing voice separation and pitch tracking for monaural polyphonic audio music, that outperforms a previous state-of-the-art approach in raw-pitch accuracy.
Singing Voice Separation and Vocal F0 Estimation Based on Mutual Combination of Robust Principal Component Analysis and Subharmonic Summation
- EngineeringIEEE/ACM Transactions on Audio, Speech, and Language Processing
- 2016
A new method of singing voice analysis that performs mutually-dependent singing voice separation and vocal fundamental frequency (F0) estimation that outperformed all the other methods ofsing voice separation submitted to an international music analysis competition called MIREX 2014.
A trend estimation algorithm for singing pitch detection in musical recordings
- Computer Science2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2011
A trend estimation algorithm to detect the pitch ranges of a singing voice in each time frame is proposed and substantially reduces the difficulty of singing pitch detection by reducing a large number of wrong pitch candidates either produced by musical instruments or the overtones of the singing voice.
Towards Solving the Bottleneck of Pitch-based Singing Voice Separation
- Computer ScienceACM Multimedia
- 2015
Two novel methods based on non-negative matrix factorization (NMF) are devised, which outperform other state-of-the-art singing separation algorithms and improve the accuracy of vocal pitch detection.
Singing voice separation using mono-channel mask
- Computer ScienceInt. J. Speech Technol.
- 2018
This work proposes a three stage system for singing voice separation which helps to improve intelligibility and perceptual quality of the separated output and observes that the singing voice separated using mono-channel mask improves the GNSDR score.
Singing voice separation using mono-channel mask
- Computer ScienceInternational Journal of Speech Technology
- 2018
This work proposes a three stage system for singing voice separation which helps to improve intelligibility and perceptual quality of the separated output and observes that the singing voice separated using mono-channel mask improves the GNSDR score.
Singing voice analysis and editing based on mutually dependent F0 estimation and source separation
- Computer Science2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
- 2015
A novel framework that improves both vocal fundamental frequency (F0) estimation and singing voice separation by making effective use of the mutual dependency of those two tasks by combining a time-frequency mask based on RPCA with a masks based on harmonic structures is presented.
Melody Extraction From Polyphonic Music Signals Using Pitch Contour Characteristics
- Computer ScienceIEEE Transactions on Audio, Speech, and Language Processing
- 2012
A comparative evaluation of the proposed approach shows that it outperforms current state-of-the-art melody extraction systems in terms of overall accuracy.
Predominant Melody Extraction from Vocal Polyphonic Music Signal by Time-Domain Adaptive Filtering-Based Method
- EngineeringCircuits, Systems, and Signal Processing
- 2017
In this paper, a time-domain adaptive filtering-based melody extraction method is proposed. The proposed method works in multiple stages to extract the vocal melody (singer’s fundamental frequency)…
References
SHOWING 1-9 OF 9 REFERENCES
Singing Pitch Extraction from Monaural Polyphonic Songs by Contextual Audio Modeling and Singing Harmonic Enhancement
- EngineeringISMIR
- 2009
This paper proposes a novel approach to extract the pitches of singing voices from monaural polyphonic songs. The hidden Markov model (HMM) is adopted to model the transition between adjacent singing…
Singing voice detection in music tracks using direct voice vibrato detection
- Computer Science2009 IEEE International Conference on Acoustics, Speech and Signal Processing
- 2009
The proposed method achieves very close results to the machine learning approach : 76.8% compared to 77.4% F-measure (frame classification).
Detecting pitch of singing voice in polyphonic audio
- Computer ScienceProceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.
- 2005
A robust algorithm to detect the pitch of a singing voice in polyphonic audio is proposed and an HMM is employed to integrate the periodicity information across frequency channels and time frames.
Measurements of the vibrato rate of ten singers
- Physics
- 1994
The vibrato rate for ten singers, all singing Schubert’s Ave Maria, was measured on sonograms. Commercially available CD records were used to insure that the vibrato originated in a real musical…
Melody Transcription From Music Audio: Approaches and Evaluation
- Computer ScienceIEEE Transactions on Audio, Speech, and Language Processing
- 2007
The results of full-scale evaluations of melody transcription systems conducted in 2004 and 2005 are described, including an overview of the systems submitted, details of how the evaluations were conducted, and a discussion of the results.
Measurement of pitch by subharmonic summation.
- MathematicsThe Journal of the Acoustical Society of America
- 1988
It is argued that the favorable performance of the subharmonic-summation algorithm stems from its corresponding more closely with current pitch-perception theories than does the harmonic sieve.
SINUSOIDAL EXTRACTION USING AN EFFICIENT IMPLEMENTATION OF A MULTI-RESOLUTION FFT
- Computer Science
- 2006
A detailed description of the spectral analysis front-end of a melody extraction algorithm that includes a novel technique for the efficient computation of STFT spectra in different time-frequency resolutions is provided.
Perceptual Evaluation of Vibrato Models
- Physics
- 2005
We promote a clearer definition of vibrato(Seashore, 1932),based on a review of various vibrato features.We also propose a generalised vibrato effect generator that includes spectral envelope…
On the Improvement of Singing Voice Separation for Monaural Recordings Using the MIR-1K Dataset
- Computer ScienceIEEE Transactions on Audio, Speech, and Language Processing
- 2010
This paper has constructed a corpus called MIR-1K (multimedia information retrieval lab, 1000 song clips), where all singing voices and music accompaniments were recorded separately, and enhanced the performance of separating voiced singing via a spectral subtraction method.