• Corpus ID: 15097078

Singing Pitch Extraction by Voice Vibrato / Tremolo Estimation and Instrument Partial Deletion

@inproceedings{Hsu2010SingingPE,
  title={Singing Pitch Extraction by Voice Vibrato / Tremolo Estimation and Instrument Partial Deletion},
  author={Chao-Ling Hsu and Jyh-Shing Roger Jang},
  booktitle={International Society for Music Information Retrieval Conference},
  year={2010}
}
This paper proposes a novel and effective approach to extract the pitches of the singing voice from monaural polyphonic songs. The sinusoidal partials of the musical audio signals are first extracted. The Fourier transform is then applied to extract the vibrato/tremolo information of each partial. Some criteria based on this vibrato/tremolo information are employed to discriminate the vocal partials from the music accompaniment partials. Besides, a singing pitch trend estimation algorithm which… 

Figures and Tables from this paper

Efficient Vocal Melody Extraction from Polyphonic Music Signals

The quantitative evaluation shows that the proposed system for automatic vocal melody extraction from polyphonic music recordings not only keeps the overall accuracy compared with the state-of-the-art approaches submitted to MIREX, but also achieves high algorithm efficiency.

Singing Voice Separation and Pitch Extraction from Monaural Polyphonic Audio Music via DNN and Adaptive Pitch Tracking

A novel and effective two-stage approach to singing pitch extraction, which involves singing voice separation and pitch tracking for monaural polyphonic audio music, that outperforms a previous state-of-the-art approach in raw-pitch accuracy.

Singing Voice Separation and Vocal F0 Estimation Based on Mutual Combination of Robust Principal Component Analysis and Subharmonic Summation

A new method of singing voice analysis that performs mutually-dependent singing voice separation and vocal fundamental frequency (F0) estimation that outperformed all the other methods ofsing voice separation submitted to an international music analysis competition called MIREX 2014.

A trend estimation algorithm for singing pitch detection in musical recordings

A trend estimation algorithm to detect the pitch ranges of a singing voice in each time frame is proposed and substantially reduces the difficulty of singing pitch detection by reducing a large number of wrong pitch candidates either produced by musical instruments or the overtones of the singing voice.

Towards Solving the Bottleneck of Pitch-based Singing Voice Separation

Two novel methods based on non-negative matrix factorization (NMF) are devised, which outperform other state-of-the-art singing separation algorithms and improve the accuracy of vocal pitch detection.

Singing voice separation using mono-channel mask

This work proposes a three stage system for singing voice separation which helps to improve intelligibility and perceptual quality of the separated output and observes that the singing voice separated using mono-channel mask improves the GNSDR score.

Singing voice separation using mono-channel mask

This work proposes a three stage system for singing voice separation which helps to improve intelligibility and perceptual quality of the separated output and observes that the singing voice separated using mono-channel mask improves the GNSDR score.

Singing voice analysis and editing based on mutually dependent F0 estimation and source separation

A novel framework that improves both vocal fundamental frequency (F0) estimation and singing voice separation by making effective use of the mutual dependency of those two tasks by combining a time-frequency mask based on RPCA with a masks based on harmonic structures is presented.

Melody Extraction From Polyphonic Music Signals Using Pitch Contour Characteristics

  • J. SalamonE. Gómez
  • Computer Science
    IEEE Transactions on Audio, Speech, and Language Processing
  • 2012
A comparative evaluation of the proposed approach shows that it outperforms current state-of-the-art melody extraction systems in terms of overall accuracy.

Predominant Melody Extraction from Vocal Polyphonic Music Signal by Time-Domain Adaptive Filtering-Based Method

In this paper, a time-domain adaptive filtering-based melody extraction method is proposed. The proposed method works in multiple stages to extract the vocal melody (singer’s fundamental frequency)

References

SHOWING 1-9 OF 9 REFERENCES

Singing Pitch Extraction from Monaural Polyphonic Songs by Contextual Audio Modeling and Singing Harmonic Enhancement

This paper proposes a novel approach to extract the pitches of singing voices from monaural polyphonic songs. The hidden Markov model (HMM) is adopted to model the transition between adjacent singing

Singing voice detection in music tracks using direct voice vibrato detection

  • L. RegnierG. Peeters
  • Computer Science
    2009 IEEE International Conference on Acoustics, Speech and Signal Processing
  • 2009
The proposed method achieves very close results to the machine learning approach : 76.8% compared to 77.4% F-measure (frame classification).

Detecting pitch of singing voice in polyphonic audio

  • Yipeng LiDeliang Wang
  • Computer Science
    Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.
  • 2005
A robust algorithm to detect the pitch of a singing voice in polyphonic audio is proposed and an HMM is employed to integrate the periodicity information across frequency channels and time frames.

Measurements of the vibrato rate of ten singers

The vibrato rate for ten singers, all singing Schubert’s Ave Maria, was measured on sonograms. Commercially available CD records were used to insure that the vibrato originated in a real musical

Melody Transcription From Music Audio: Approaches and Evaluation

The results of full-scale evaluations of melody transcription systems conducted in 2004 and 2005 are described, including an overview of the systems submitted, details of how the evaluations were conducted, and a discussion of the results.

Measurement of pitch by subharmonic summation.

  • D. J. Hermes
  • Mathematics
    The Journal of the Acoustical Society of America
  • 1988
It is argued that the favorable performance of the subharmonic-summation algorithm stems from its corresponding more closely with current pitch-perception theories than does the harmonic sieve.

SINUSOIDAL EXTRACTION USING AN EFFICIENT IMPLEMENTATION OF A MULTI-RESOLUTION FFT

A detailed description of the spectral analysis front-end of a melody extraction algorithm that includes a novel technique for the efficient computation of STFT spectra in different time-frequency resolutions is provided.

Perceptual Evaluation of Vibrato Models

We promote a clearer definition of vibrato(Seashore, 1932),based on a review of various vibrato features.We also propose a generalised vibrato effect generator that includes spectral envelope

On the Improvement of Singing Voice Separation for Monaural Recordings Using the MIR-1K Dataset

  • Chao-Ling HsuJ. Jang
  • Computer Science
    IEEE Transactions on Audio, Speech, and Language Processing
  • 2010
This paper has constructed a corpus called MIR-1K (multimedia information retrieval lab, 1000 song clips), where all singing voices and music accompaniments were recorded separately, and enhanced the performance of separating voiced singing via a spectral subtraction method.