• Corpus ID: 52904432

Speech Coding : A Thtorial Review

@inproceedings{Spanias2004SpeechC,
  title={Speech Coding : A Thtorial Review},
  author={Andreas Spanias},
  year={2004}
}
The past decade has witnessed substantial progress towards the application of low-rate speech coders to civilian and military communications as well as computer-related voice applications. Central to this progress has been the development of new speech coders capable of producing high-quality speech at low data rates. Most of these coders incorporate mechanisms to: represent the spectral properties of speech, provide fo r speech waveform matching, and "optimize" the coder's performance for the… 
A REVIEW ON LOW BIT RATE SPEECH CODING
TLDR
A review of low bit rate speech coding is given and some applications which can benefit from the development of algorithms to significantly reduce the speech data rate are given.

References

SHOWING 1-10 OF 96 REFERENCES
Predictive Coding of Speech at Low Bit Rates
  • B. Atal
  • Computer Science
    IEEE Trans. Commun.
  • 1982
TLDR
A new class of speech coders are described which allow one to realize the precise optimum noise spectrum which is crucial to achieving very low bit rates, but also represent the important first step in bridging the gap between waveform coders and vocoders without suffering from their limitations.
Subjective speech-to-noise ratio as a measure of speech quality for digital waveform coders.
TLDR
The subjective speech‐to‐noise‐ratio (SNR), derived from the forced‐choice pair‐comparison test using the psychometric analysis procedure commonly used in the method of constants, is evaluated and well represents overall speech quality in a single dimension.
Coding of speech and wideband audio
TLDR
Advances in coding algorithms and digital signal processing have led to sophisticated technologies for speech communication for a variety of applications, as well as to greater flexibilities in the design of ISDN terminals, which implies stereo teleconferencing or dual-language programming over a 64-kb/s channel.
High-Quality 800-b/s Voice Processing Algorithm.
TLDR
This report presents a new 800-b/s voice-encoding method that produces an intelligibility score of 92 (measured by the Diagnostic Rhyme Test (DRT), which compares favorably with that attained by the 2400- b/s LPC.
Vector quantization and perceptual criteria for low-rate coding of speech
  • M. Copperi, D. Sereno
  • Computer Science
    ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing
  • 1985
TLDR
The main objective is improving the excitation representation in a linear predictive coding scheme and, hence, the subjective quality of synthesized speech signals.
Digital coding of speech waveforms: PCM, DPCM, and DM quantizers
TLDR
It is pointed out that error waveforms in speech quantization cannot be regarded as additive white noise, in general, and that for finer assessments of speech coders, either relative or absolute, one needs to supplement SNR-based observations with corrections for subjective and perceptual factors.
Application of line-spectrum pairs to low-bit-rate speech encoders
  • G. Kang, L. Fransen
  • Computer Science
    ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing
  • 1985
TLDR
The use of Line-Spectrum Pairs (LSPs) makes it possible to employ bit-saving measures more readily than the better known reflection coefficients, and the intelligibility of an LSP-based, pitch-excited vocoder can be made as high as 87 for three male speakers.
Vector quantization: A pattern-matching technique for speech coding
TLDR
Recent results obtained in waveform coding of speech with vector quantization are reviewed, with Vector quantization appearing to be a suitable coding technique which caters to this dual requirement of effective speech coding.
A Low-Delay CELP Coder for the CCITT 16 kb/s Speech Coding Standard
TLDR
The official CCITT laboratory tests revealed that the speech quality of this 16 kb/s LD-CELP coder is either equivalent to or better than that of the CCITT G.721 standard 32-kb/s ADPCM coder for almost all conditions tested.
Backward Adaptive Configurations for Low-Delay Vector Excitation Coding
TLDR
Many of the advances in speech coding in the past decade at rates of 4.8–16 kbit/s have been based on excitation coding by means of analysis-by-synthesis, which is often called Vector Excitation Coding (VXC) or Code Excited Linear Prediction (CELP).
...
...