A speech coder based on decomposition of characteristic waveforms

@article{Kleijn1995ASC,
  title={A speech coder based on decomposition of characteristic waveforms},
  author={W. Kleijn and Jesper Haagen},
  journal={1995 International Conference on Acoustics, Speech, and Signal Processing},
  year={1995},
  volume={1},
  pages={508-511 vol.1}
}
  • W. Kleijn, J. Haagen
  • Published 9 May 1995
  • Computer Science
  • 1995 International Conference on Acoustics, Speech, and Signal Processing
For low-rate speech coding it is advantageous to represent the speech signal as an evolving characteristic waveform (CW). The CW evolves slowly when the speech signal is clearly voiced and rapidly when the speech signal is clearly unvoiced. The voiced (periodic) and unvoiced (nonperiodic) components of the speech signal can be separated by a simple nonadaptive filter in the CW domain. Because of perceptual effects, a significant increase in coding efficiency is obtained by coding these two… 

Figures and Tables from this paper

Multiband prototype waveform analysis synthesis for very low bit rate speech coding

  • K. YaghmaieA. Kondoz
  • Computer Science
    1997 IEEE International Conference on Acoustics, Speech, and Signal Processing
  • 1997
TLDR
This paper presents a novel combination of the multiband voicing analysis and PWI coding system in which theMultiband analysis is exploited to identify the voiced and unvoiced spectral components of the prototype waveforms of the original speech signal.

A new 1.2 kbit/s speech coding method based on a sinusoidal harmonic vocoder

  • M. Kohata
  • Physics
    Systems and Computers in Japan
  • 2000
A very low bit rate speech coder at 1.2 kbit/s is proposed in which a speech signal is synthesized by using sinusoids whose frequencies are multiples of the fundamental frequency, and whose

Harmonic-stochastic excitation (HSX) speech coding below 4 kbit/s

This paper presents an algorithm for encoding speech signals at bit rates below 4 kbit/s based on a mixed harmonic and stochastic modeling of the excitation signal. The algorithm uses robust pitch

Source controlled variable bit-rate speech coder based on waveform interpolation

TLDR
The SC-VBR coder was evaluated against a fixed rate 4.6kbit/s PWI coder for clean speech and noisy speech and was found to perform better for male speech and for noisy speech.

Smoothing the Evolution of the Spectral Parameters in Speech Coders

New generation of speech coders have to achieve two goals: efficient use of bandwidth and high speech quality. The objective of this thesis is to improve the modelling of speech signal within the

Efficient mixed excitation models in LPC based prototype interpolation speech coders

  • C. PapanastasiouC. Xydeas
  • Computer Science
    1997 IEEE International Conference on Acoustics, Speech, and Signal Processing
  • 1997
TLDR
A new and efficient method for modeling voiced, mixed excitation spectra in sinusoidal (SC) and prototype interpolation coding (PIC) systems that allows the mixing of periodic and random signal energy on a harmonic basis is presented.

A pitch pulse evolution model for linear predictive coding of speech

TLDR
A new speech compression technique designed for near toll quality speech coding at bit rates as low as 4 kb/s is presented, and a robust algorithm for extracting noisy pitch pulses from the LP residual based on error minimization with respect to a set of model pulses is developed.

A 2.4 Kbps Waveform Interpolation Speech Codec Incorporating Wavelet-based Techniques

TLDR
A wavelet-based pitch detector is invoked, which substantially reduces the complexity of conventional autocorrelation- based pitch detectors, while ensuring smooth pitch traject-ory evolution, and is incorporated in a waveform-interpolated codec.

1.2 kbit/s harmonic coder using auditory filters

  • M. Kohata
  • Computer Science
    1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258)
  • 1999
TLDR
A very low bit speech coder at 1.2 kbps, like the LPC vocoder, it only requires gain, pitch, and spectral information, but its quality is far superior, and it was comparable to that of 2.4 kbit/s MELP coder.
...

References

SHOWING 1-10 OF 14 REFERENCES

Encoding speech using prototype waveforms

  • W. Kleijn
  • Engineering
    IEEE Trans. Speech Audio Process.
  • 1993
TLDR
The coding method is easily combined with existing LP-based speech coders, such as CELP, for unvoiced signals and excellent voiced speech quality is obtained at rates between 3.0 and 4.0 kb/s.

Continuous representations in linear predictive coding

  • W. Kleijn
  • Computer Science
    [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing
  • 1991
TLDR
A novel method of coding voiced speech is introduced, which transmits an encoded prototype waveform at 20-30 ms intervals, and is quantized using analysis-by-synthesis methods, which results in excellent speech quality at rates between 3.0 and 4.0 kb/s.

Improvements in 2.4 kbps high-quality speech coding

TLDR
An algorithm for 2.4 kb/s speech coding is described, which results in a better compromise between bit allocation for short-term quantization and residual coding and an improved high-frequency regeneration.

Transformation and decomposition of the speech signal for coding

TLDR
The decomposition of the characteristic waveform is decomposed into a slowly evolving waveform and a rapidly evolving waveforms, representing the quasi-periodic and other components of speech, respectively, which allows efficient coding of voiced and unvoiced speech at bit rates between 2 and 8 kb/s.

A mixed prototype waveform/CELP coder for sub 3 kbit/s

  • I. BurnettR. Holbeche
  • Computer Science, Business
    1993 IEEE International Conference on Acoustics, Speech, and Signal Processing
  • 1993
TLDR
The authors address problems by using a prototype waveform coder for voiced frames while retaining a CELP algorithm for unvoiced frames for sub-3 kbit/s coding and two coding schemes for the prototypes are discussed.

High-quality speech coding at 2.4 to 4.0 kbit/s based on time-frequency interpolation

  • Y. Shoham
  • Computer Science, Business
    1993 IEEE International Conference on Acoustics, Speech, and Signal Processing
  • 1993
The author presents a novel algorithm for high-quality coding and demonstrates the advantage of the proposed coder over the conventional CELP (code-excited linear prediction) algorithm for low rate

Real-time vector APC speech coding at 4800 bps with adaptive postfiltering

  • Juin-Hwey ChenA. Gersho
  • Computer Science
    ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing
  • 1987
An improved Vector APC (VAPC) speech coder at 4800 bps produces speech with very good communications quality while maintaining a complexity low enough to allow a real-time implementation with at most

Low-bit-rate speech coding using a two-dimensional transform of residual signals and waveform interpolation

  • Yoshinori TanakaH. Kimura
  • Business
    Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing
  • 1994
TLDR
Two efficient excitation coding schemes that use a pitch synchronous two-dimensional Fourier transform of linear prediction residual signals and prototype waveform interpolation (PWI) coding using smoothed prototype waveforms are proposed.

Performance of noise excitation for unvoiced speech

  • G. KubinB. AtalW. Kleijn
  • Physics, Engineering
    Proceedings., IEEE Workshop on Speech Coding for Telecommunications,
  • 1993
TLDR
This paper addresses the question what perceptual quality can be achieved for unvoiced speech by a linear model with white noise excitation and demonstrates that this linear model results in unvoicing speech of high perceptual quality.

Synthesis of breathy vowels: Some research methods