An international comparison of long‐term average speech spectra

@article{Byrne1994AnIC,
  title={An international comparison of long‐term average speech spectra},
  author={Denis Byrne and Harvey Dillon and Khanh Vien Tran and Stig Arlinger and Keith Wilbraham and Robyn M. Cox and Bj{\"o}rn Hagerman and Raymond H{\'e}tu and Joseph Kei and Crystal P Y Lui and J{\"u}rgen Kiessling and M. Nasser Kotby and Nasser H. Abdel Nasser and Wafaa Abdel Hai El Kholy and Yasuko Nakanishi and Herbert J. Oyer and Richard Powell and Dafydd Stephens and Rhys Meredith and Tony Sirimanna and George Tavartkiladze and Gregory I. Frolenkov and Soren Westerman and Carl Ludvigsen},
  journal={Journal of the Acoustical Society of America},
  year={1994},
  volume={96},
  pages={2108-2120}
}
The long‐term average speech spectrum (LTASS) and some dynamic characteristics of speech were determined for 12 languages: English (several dialects), Swedish, Danish, German, French (Canadian), Japanese, Cantonese, Mandarin, Russian, Welsh, Singhalese, and Vietnamese. The LTASS only was also measured for Arabic. Speech samples (18) were recorded, using standardized equipment and procedures, in 15 localities for (usually) ten male and ten female talkers. All analyses were conducted at the… 

Figures and Tables from this paper

Cross-language comparison of long-term average speech spectrum and dynamic range for three Indian languages and British English

Purpose: The Long-Term Average Speech Spectrum (LTASS) and Dynamic Range (DR) of speech strongly influence estimates of Speech Intelligibility Index (SII), gain and compression required for hearing

Long-Term Average Speech Spectra and Dynamic Ranges of 17 Indian Languages.

The long-term average speech spectra (LTASS) and dynamic ranges (DR) of 17 Indian languages are determined and a common LTASS and DR is proposed for Indian languages to help improve the performance of hearing aids in the Indian context.

Cross-Language Identification of Long-Term Average Speech Spectra in Korean and English: Toward a Better Understanding of the Quantitative Difference Between Two Languages

To adjust the formula for fitting hearing aids for Koreans, the results based on the LTASS analysis suggest that one needs to raise the gain in high-frequency regions.

How Does Speaking Clearly Influence Acoustic Measures? A Speech Clarity Study Using Long-term Average Speech Spectra in Korean Language

This study showed that the drop-off of the LTASS in the low frequency region might make the ratings of women and announcers more clearly than those of men and ordinary persons respectively.

Development of the Polish Speech Test Signal and its Comparison with the International Speech Test Signal

The aim of this study was to create a single-language counterpart of the International Speech Test Signal (ISTS) and to compare both with respect to their acoustical characteristics, finding some differences between ISTS and PSTS.

Long-term average spectra of adult Iranian speakers' voice.

Dynamic range for speech materials in korean, english, and mandarin: a cross-language comparison.

The observed differences in DR across languages suggest that the best-fit DR for Korean and Mandarin may be different than the best fit for English.

Speaker Discrimination Using Long-Term Spectrum of Speech

The voiced speech seems to be generally more effective for speaker recognition using the long-term speech spectrum, and the best recognition rates were achieved in optimal paired subbands, which can complement the traditional voice features.

A Swedish version of the Hearing In Noise Test (HINT) for measurement of speech recognition

A Swedish Hearing In Noise Test (HINT), consisting of everyday sentences to be used in an adaptive procedure to estimate the speech recognition thresholds in noise and quiet, has been developed and resulted in a well-defined and internationally comparable set of sentences.

Sentence recognition in native- and foreign-language multi-talker background noise.

These findings demonstrate informational masking on sentence-in-noise recognition in the form of "linguistic interference" at the lexical, sublexical, and/or prosodic levels of linguistic structure and whether it is modulated by the phonetic similarity between the target and noise languages remains to be determined.
...

References

SHOWING 1-10 OF 34 REFERENCES

Distribution of short-term rms levels in conversational speech.

The effects of measurement interval was least for the highest amplitude speech levels and increased as speech levels decreased, whereas for short-term amplitudes below the median level, measurement interval had the greatest effect on the lower frequency bands.

On the Spectrum of Spoken English

The spectrum of spoken English was measured in 13‐octave bands for men, women, and children. The speech produced by reading aloud newspaper text was recorded in an anechoic chamber for each member of

Ear level recordings of the long-term average spectrum of speech.

The findings suggest that the algorithms currently used to prescribe hearing aid gain may underestimate the sensationlevel of a hearing-impaired individual's own amplified speech productions at frequencies below 1000 Hz and overestimate the sensation level of a talker's own speech above 2500 Hz.

The speech spectrum--some aspects of its significance for hearing aid selection and evaluation.

  • D. Byrne
  • Physics
    British journal of audiology
  • 1977
Considering that speech spectra differ greatly from one individual to another, it would seem desirable that speech with a known, preferably average, spectrum be used in hearing aid evaluation procedures which involve setting the volume control of a hearing aid to deliver speech at a comfortable listening level.

Composite speech spectrum for hearing and gain prescriptions.

It was concluded that a single spectrum could validly be used to represent both male and female speech in the frequency region important for hearing aid gain prescriptions: 250 Hz through 6300 Hz.

Some Variables in Audio Spectrometry

Measurements of the long‐time average spectrum of speech are reported with the purpose of pointing out the roles of several variables. Three different sampling times of three different types of

Statistical Measurements on Conversational Speech

Using apparatus designed to collect a large number of data in a short time, the following measurements have been made: peak and r.m.s. pressures in one‐eighth‐second intervals, and in various bands

The National Acoustic Laboratories' (NAL) New Procedure for Selecting the Gain and Frequency Response of a Hearing Aid

It is concluded that the new formula for selecting the gain and frequency response of a hearing aid should prescribe a near optimal frequency response with few exceptions.

The perception of speech and its relation to telephony.

Relationships among several of these measures and the articulation index are established and functions are developed which permit the calculation of articulations index and hence of articulation for communication systems which include a wide variety of response versus frequency characteristics and of noise conditions, as well as several special types of distortion.

Exploration of Pressure Field Around the Human Head During Speech

The pressure spectrum of average speech has been measured at 80 different positions about the head of a speaker, thus showing the directional properties of the mouth and head as a sound radiator. The