Speech analysis/Synthesis based on a sinusoidal representation
@article{McAulay1986SpeechAB,
title={Speech analysis/Synthesis based on a sinusoidal representation},
author={Robert J. McAulay and Thomas F. Quatieri},
journal={IEEE Trans. Acoust. Speech Signal Process.},
year={1986},
volume={34},
pages={744-754}
}A sinusoidal model for the speech waveform is used to develop a new analysis/synthesis technique that is characterized by the amplitudes, frequencies, and phases of the component sine waves. [] Key Method For a given frequency track a cubic function is used to unwrap and interpolate the phase such that the phase track is maximally smooth. This phase function is applied to a sine-wave generator, which is amplitude modulated and added to the other sine waves to give the final speech output.
1,939 Citations
Speech Processing Based on a Sinusoidal Model
- Physics
Using a sinusoidal model of speech, an analysis/synthesis technique has been developed that characterizes speech in terms of the amplitudes, frequencies, and phases of the component sine waves. These…
Sine-Wave Amplitude Coding at Low Data Rates
- Physics
- 1991
An analysis/synthesis system based on the sinusoidal speech model has been developed [1]. In that system, the sine-wave amplitudes and frequencies are located by searching for the peaks of the…
Mixed-phase deconvolution of speech based on a sine-wave model
- EngineeringICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing
- 1987
Speech modification with a mixed-phase system estimate is shown to be capable of more closely preserving waveform shape in time-scale and pitch transformations than the earlier approach.
Speech transformations based on a sinusoidal representation
- PhysicsIEEE Trans. Acoust. Speech Signal Process.
- 1986
A new speech analysis/synthesis technique is presented which provides the basis for a general class of speech transformations including time-scale modification, frequency scaling, and pitch modification, based on a sinusoidal representation of the speech production mechanism.
Speech enhancement based on a sinusoidal model.
- PhysicsJournal of speech and hearing research
- 1994
Reducing the number of sinusoids used to represent the speech causes reduced consonant recognition and perceived intelligibility both in quiet and in noise, and suggests that similar results would be expected for listeners with hearing impairments.
An Analysis/Synthesis System of Audio Signal with Utilization of an SN Model
- Physics
- 2004
An SN (sinusoids plus noise) model is a spectral model, in which the periodic components of the sound are represented by sinusoids with time-varying frequencies, amplitudes and phases. The remaining…
SPEECH SYNTHESIS BASED ON SINUSOIDAL MODELING
- Physics
- 2004
This report presents an introduction to speech synthesis with a brief overview of some methods and their associated problems. The usage of sinusoidal representation of speech waveform for producing…
Speech analysis/Synthesis based on matching the synthesized and the original representations in the auditory nerve level
- PhysicsICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing
- 1986
The results demonstrate the adequacy of the in-synchrony-bands measure in selecting the perceptually meaningful frequency regions of the stimulus spectra and significantly reduces the number of sinusoidal components needed for synthesis by approximately 70 percent, offering the potential for reduced data-rate.
An approach to co-channel talker interference suppression using a sinusoidal model for speech
- GeologyICASSP-88., International Conference on Acoustics, Speech, and Signal Processing
- 1988
Evidence is provided that the sinusoidal analysis/synthesis model with effective parameter estimation techniques offers a promising approach to the problem of cochannel talker-interference suppression over a range of conditions.
Estimation of sinusoids in audio signals using an analysis-by-synthesis neural network
- Physics2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221)
- 2001
An analysis-by-synthesis system of neural networks is used to extract the sinusoidal parameters from the signal spectrum at each window position of the short-term Fourier transform, finding the set of sinusoids that best fits the spectral representation in a least-squares sense.
References
SHOWING 1-9 OF 9 REFERENCES
Mid-rate coding based on a sinusoidal representation of speech
- Computer ScienceICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing
- 1985
In this paper a sinusoidal model for the speech waveform is used to develop a new analysis/synthesis technique that is characterized by the amplitudes, frequencies, and phases of the component sine…
Magnitude-only reconstruction using a sinusoidal speech modelMagnitude-only reconstruction using a sinusoidal speech model
- Physics, GeologyICASSP
- 1984
A sinusoidal model for the speech waveform is used to develop a new synthesis technique that requires specification of only the amplitudes and frequencies of the component sine waves, and preserves the short-time spectral magnitude during rapid movements of spectral energy.
A new model of LPC excitation for producing natural-sounding speech at low bit rates
- PhysicsICASSP
- 1982
This paper describes a new approach to the excitation problem that does not require a priori knowledge of either the voiced-unvoiced decision or the pitch period, and minimizes a perceptual-distance metric representing subjectively-important differences between the waveforms of the original and the synthetic speech signals.
Variable-frequency synthesis: An improved harmonic coding scheme
- Computer ScienceICASSP
- 1984
Harmonic Coding is synthesized in the time domain, as a superimposition of "harmonics" whose instantaneous frequency varies continuously along an interpolation curve, within each frame, so that fast pitch variations can be tracked with no difficulty.
Computer studies on parametric coding of speech spectra
- Physics
- 1980
We report a series of computer experiments aimed to increase our understanding about the sufficiency of the short‐time amplitude spectrum for speech coding, and to examine how bandpass segments of…
Parametric coding of speech spectra
- Physics
- 1980
We suggest that a class of speech coders can be designed from criteria that are perception‐specific. Such a class represents a middle ground between ’’waveform’’ coders and speech‐specific ’’source’’…
A tone oriented voice excited vocoder
- Physics, Computer ScienceICASSP
- 1981
An LPC base-band vocoder is developed and experiments have shown the coder to be robust to background noise and implementation aspects as well as simulation results are discussed.
Speech transformations based on a sinusoidal representation
- PhysicsICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing
- 1985
A new speech analysis/synthesis technique is presented which provides the basis for a general class of speech transformations including time-scale modification, frequency scaling, and pitch modification, based on a sinusoidal representation of the speech production mechanism.







