• Corpus ID: 18072620

Coding for High-Resolution Audio Systems

@article{Stuart2004CodingFH,
  title={Coding for High-Resolution Audio Systems},
  author={John Robert Stuart},
  journal={Journal of The Audio Engineering Society},
  year={2004},
  volume={52},
  pages={117-144}
}
  • J. R. Stuart
  • Published 15 March 2004
  • Computer Science
  • Journal of The Audio Engineering Society
What do we mean by high resolution'? The recording and replay chain is reviewed from the viewpoints of digital audio engineering and human psychoacoustics. An attempt is made to define high resolution and to identify the characteristics of a transparent digital audio channel. The theory and practice of selecting high sample rates such as 96 kHz and word lengths of up to 24 bit are examined. The relative importance of sampling rate and word size at various points in the recording, mastering… 

High-Resolution Audio: A History and Perspective

TLDR
The current view is that, beyond dynamic range, the most likely technical sources differentiating the sound of digital formats are the filtering chains that are ubiquitous in traditional digital sampling and reconstruction of analog music sources.

Audibility of a CD-Standard A/D/A Loop Inserted into High-Resolution Audio Playback*

Claims both published and anecdotal are regularly made for audibly superior sound quality for two-channel audio encoded with longer word lengths and/or at higher sampling rates than the

Lossless Audio Coding with Bandwidth Extension Layers

  • S. Voran
  • Computer Science
    2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics
  • 2007
TLDR
A layered audio coding paradigm of bandwidth extension, rather than distortion reduction, is proposed, where a core layer can provide lossless coding of a 24 kHz bandwidth signal, then first and second bandwidth extension lossless layers can extend that signal to losslessly coded 48 and then 96 kHz bandwidths.

Probing the temporal resolution and bandwidth of human hearing

Experiments were conducted to assess the human discriminability of temporal convolution. The experiments employed either lowpass filtering or delays due to spatial misalignment. By using special

A Hierarchical Approach for Audio Capture, Archive, and Distribution

TLDR
An audio capture, archiving, and distribution methodology based on sampling kernels having finite length, unlike the “ideal” sinc kernel that extends indefinitely is proposed, and it is shown that with the new kernels, original transient events need not become significantly extended in time when reproduced.

Delivering spatial audio

TLDR
This paper explores the different media available from physical discs, including CD and DVD, through broadcast media and portable players to digital delivery via the Internet, focusing particularly on listener perception of quality and ease of use.

Audibility of temporal smearing and time misalignment of acoustic signals

Misalignment in timing between drivers in a speaker system and temporal smearing of signals in components and cables have long been alleged to cause degradation of fidelity in audio reproduction. It

Cable Pathways Between Audio Components Can Affect Perceived Sound Quality

TLDR
This work utilized a high-performance audio system and an extended-duration listening protocol that more closely resembles audiophile auditioning conditions to prove through direct psychoacoustic testing that two different analog-interconnect pathways can be audibly distinguished.

A Meta-Analysis of High Resolution Audio Perceptual Evaluation

There is considerable debate over the benefits of recording and rendering high resolution audio, i.e., systems and formats that are capable of rendering beyond CD quality audio. We undertook a

A Hierarchical Approach to Archiving and Distribution

TLDR
The aim is an improved time/frequency balance in a high-performance chain whose errors, from the perspective of the human listener, are equivalent to no more than those introduced by sound travelling a short distance through air.

References

SHOWING 1-10 OF 60 REFERENCES

Dynamic-Range Issues in the Modern Digital Audio Environment

The peak sound levels of music performances are combined with the audibility of noise in sound reproduction circumstances to yield a dynamic-range criterion for noise-free reproduction of music. This

Pulse-Code Modulation — An Overview *

TLDR
This brief survey paper argues that pulse-code-modulation encoding of digital audio signals forms the logical way to extend either the bandwidth or the signal-to-noise ratio of a digital audio system, or both, to encompass even higher resolution.

Antialias Filters and System Transient Response at High Sample Rates

Sample rates higher than 48 kHz allow freedom to tailor the audio response above 20 kHz in order to optimize the transient performance. A recording and reproduction chain may have pre- and

One-Bit Audio: An Overview

An overview of 1-bit audio processing is presented. Several characteristics of the sigma-delta modulator (SDM), currently the most often used device to generate 1-bit code, are discussed, as well as

A High-Rate Buried-Data Channel for Audio CD

TLDR
This proposal uses pseudorandomized data as noise-shaped subtractive dither for the conventional audio to conveying a high-data data channel compatibly within the data stream of an audio CD without significant impairment of existing CD performance.

Binaural time discrimination

The least discriminable change in the position of a sound image was measured for pure tone with and without initial interaural delay, as well as for complex tones. The signals were either perfectly

Why 1-Bit Sigma-Delta Conversion is Unsuitable for High-Quality Applications

TLDR
It is demonstrated that single-stage, 1-bit sigma-delta converters are in principle imperfectible, and using coherent averaging techniques, they are able to display the consequent profusion of nonlinear artefacts which are usually hidden in the noise floor.

Thermal noise limits of microphones

The fundamental limit to sensitivity of acoustic tranducers is material to our understanding of nature, has application to microphone technology, and provides a standard with which to assess the

Noise: Methods for Estimating Detectability and Threshold

TLDR
Estimation techniques based on weighting are compared with those given by detection criteria applied applied to more sophiscated auditory modeling, and various measures of human auditory frequency selectivity are constrasted.

Microsecond temporal resolution in monaural hearing without spectral cues?

TLDR
A monaural masking experiment is described in which pairs of continuous sounds with microsecond time differences were combined and presented to both ears, indicating that the limit of temporal resolution in this case is similar to that in the binaural system.
...