A low-complexity waveform interpolation coder

  title={A low-complexity waveform interpolation coder},
  author={W. Kleijn and Yair Shoham and Deep Sen and Roar Hagen},
  journal={1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings},
  pages={212-215 vol. 1}
  • W. Kleijn, Y. Shoham, R. Hagen
  • Published 7 May 1996
  • Computer Science
  • 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings
A recent independent survey found a 2.4 kbit/s waveform-interpolation (WI) algorithm to perform better than other state-of-the-art speech coders. However, this coder had a very high level of computational complexity. The introduction of various techniques, including a time-varying waveform sampling rate and a cubic B-spline waveform representation, has reduced the computational complexity by an order of magnitude. The new implementation allows full-duplex real-time operation on a single DSP… 

Figures and Tables from this paper

A Low-complexity Improved WI Speech Coding at 2 kbps

A low-complexity improved waveform interpolation speech coder at 2kbps is proposed and has greatly reduced the computational complexity and improved the reconstructed speech quality by using various techniques, including FFT, cubic B-spline interpolation, nonlinear frequency resolution and so on.

A 2.4kbps Multiband Characteristic Waveform Interpolation Speech Coding Algorithm

The result shows that the new algorithm can rebuild the speech high frequency harmonics perfectly and is obviously superior to the conventional CWI coder and also slightly exceeds the MELP coder.

Enhanced waveform interpolative coding at low bit-rate

This paper presents a high quality enhanced waveform interpolative (EWI) speech coder at low bit-rate. The system incorporates novel features such as optimization of the slowly evolving waveform

2 kbps-2.4 kbps low complexity interpolative vocoder

  • Jiang HaoCui Hui-juanTang Kun
  • Computer Science
    ICCT'98. 1998 International Conference on Communication Technology. Proceedings (IEEE Cat. No.98EX243)
  • 1998
This speech coder takes the good ideal of waveform interpolation, but adopts a different framework to simplify the system complexity and presents some important techniques such as one-frame look ahead pitch smoothing algorithm and parameter quantization.

A Simplified Algorithm for Representation and Alignment of the Characteristic Waveform in WI Speech Coding

  • Fengyan QiC. Bao
  • Computer Science
    2006 8th international Conference on Signal Processing
  • 2006
The improved algorithm for CW representation and CW alignment was proposed by using FFT, cubic B-spline interpolation and period extension and results show that the improved algorithm has greatly reduced the computational complexity and kept the same speech quality as original coder.

Multi-prototype waveform coding using frame-by-frame analysis-by-synthesis

  • I. BurnettDuong H. Pham
  • Computer Science
    1997 IEEE International Conference on Acoustics, Speech, and Signal Processing
  • 1997
In the new scheme, a frame of prototype waveforms is quantised using two sets of codebook searches, one representing the slowly evolving prototype shape and the other the rapid, noisy components, resulting in a generalised analysis-by-synthesis waveform interpolation architecture with closed-loop optimisation of all prototype waveform properties.

Enhancing waveform interpolative coding with weighted REW parametric quantization

  • O. GottesmanA. Gersho
  • Computer Science
    2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421)
  • 2000
An efficient quantization technique for the rapidly-evolving waveforms in waveform interpolative (WI) coders that takes advantage of spectral and temporal masking and leads to improved reconstructed speech quality, most notably in mixed voiced and unvoiced speech segments.

A waveform extractor for scalable speech coding

  • M. A. Ramírez
  • Computer Science
    2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).
  • 2003
A waveform extractor is proposed which samples waveform cycles of the original prediction residual signal at their natural variable rate so that it can perfectly reconstruct the signal.

Low complexity speech coding at 1.2 to 2.4 kbps based on waveform interpolation

  • Y. Shoham
  • Computer Science
    Int. J. Speech Technol.
  • 1999
New approaches to low-complexity speech coding at coding rates of 1.2 and 2.4 kbps are proposed, which greatly simplify the analysis and synthesis operations to a point where low-cost two-way digital speech communication can be easily implemented on DSP or host platforms.



A speech coder based on decomposition of characteristic waveforms

  • W. KleijnJ. Haagen
  • Computer Science
    1995 International Conference on Acoustics, Speech, and Signal Processing
  • 1995
A 2.4 kb/s coder using waveform interpolation principles to represent the speech signal as an evolving characteristic waveform (CW) and a significant increase in coding efficiency is obtained by coding these two components separately.

Synthesis Methods In Sinusoidal And Waveform-interpolation Coders

  • D. SenW. Kleijn
  • Geology
    Proceedings. IEEE Workshop on Speech Coding for Telecommunications
  • 1995
Waveform-interpolation (WI) coders can use various r e p resentations for the characteristic waveforms. For the commonly used Fourier-series representation, the parameterization of a WI coder is

Harmonic and noise coding of LPC residuals with classified vector quantization

An efficient coding scheme for linear predictive coding (LPC) residuals is proposed based on harmonic and noise representation, which reduces the high complexity of the direct sinusoidal synthesis method with interpolated magnitudes and phases.

Robust vector quantization in spectral coding

  • R. HagenP. Hedelin
  • Computer Science
    1993 IEEE International Conference on Acoustics, Speech, and Signal Processing
  • 1993
Using an LSP (line spectrum pair)-representation for the spectrum, it is demonstrated that short block codes are feasible, thus allowing for compact storage of the code-book and secure channel robustness while allowing for efficient design, storage, and handling of the vector quantizer.

The Dod 4.8 Kbps Standard (Proposed Federal Standard 1016)

A code excited linear predictive (CELP) coder, jointly developed by the DoD and ATT it’s even comparable to 32,000 bps continuously variable slope delta modulation (CVSD) and is robust in acoustic noise, channel errors, and tandem coding conditions.

Encoding Prototype Waveforms Using A Phase Codebook

A prototype encoding technique is proposed which uses spectral magnitude and phase codebooks and shows significant performance improvement with respect to systems using direct time-domain vector quantization of the prototype waveforms.

Adaptive postfiltering for quality enhancement of coded speech

This paper presents a complete description of the original postfiltering algorithm and the underlying ideas that motivated its development, and achieves noticeable noise reduction while introducing only minimal distortion in speech.

B-spline signal processing. I. Theory

The use of continuous B-spline representations for signal processing applications such as interpolation, differentiation, filtering, noise reduction, and data compressions, and the extension of such operators for higher-dimensional signals such as digital images is considered.

Progress towards a new government standard 2400 bps voice coder

The status of the testing and evaluation of several candidate 2400 bps voice compression algorithms from its beginning in early 1993 through the end of 1994 is described.

A Test of Speaker Recognition Using Human Listeners

  • A. Schmidt-Nielsen
  • Physics
    Proceedings. IEEE Workshop on Speech Coding for Telecommunications
  • 1995
This work has developed a speaker recognition test based on SAME-DIFFERENT judgments for pairs of sentences spoken by either the same or different speakers.