• Publications
  • Influence
Real-Time Signal Estimation From Modified Short-Time Fourier Transform Magnitude Spectra
TLDR
An algorithm for estimating signals from short-time magnitude spectra is introduced offering a significant improvement in quality and efficiency over current methods, and is applied to audio time-scale and pitch modification and compared to classical algorithms for these tasks on a variety of signal types.
Audio Spectrogram Representations for Processing with Convolutional Neural Networks
  • L. Wyse
  • Computer Science
    ArXiv
  • 1 May 2017
TLDR
This paper reviews some of these representations and issues that arise, focusing particularly on spectrograms for generating audio using neural networks for style transfer.
An enhanced musical experience for the deaf: design and evaluation of a music display and a haptic chair
TLDR
A prototype system designed to enrich the experience of music for the deaf by enhancing sensory input of information via channels other than in-air audio reception by the ear is described.
A spectral network model of pitch perception.
A model of pitch perception, called the spatial pitch network or SPINET model, is developed and analyzed. The model neurally instantiates ideas from the spectral pitch modeling literature and joins
AN EFFICIENT ALGORITHM FOR REAL-TIME SPECTROGRAM INVERSION
TLDR
A computationally efficient real-time algorithm for constructing audio signals from spectrograms that produces excellent subjective sound quality with minimal fame transition artifacts and is compared with three other methods.
Real-Time Iterative Spectrum Inversion with Look-Ahead
In this paper, we present an algorithm for real-time iterative spectrogram inversion (RTISI) with look-ahead (RTISI-LA). RTISI-LA reconstructs a time-domain signal from a given sequence of short-time
Single Pass Spectrogram Inversion
TLDR
The Single-Pass Spectrogram Inversion (SPSI) algorithm is similar to the synthesis step in phaselocked vocoders, but with phase rates at spectral peaks determined solely from the magnitude spectra using quadratic interpolation.
SOUND TEXTURE MODELING AND TIME-FREQUENCY LPC
TLDR
By using LPC filters in both the time and frequency domain and a statistical representation of the transient sequence, the perceptual quality of the sound textures can be largely preserved, and the model used to manipulate and extend the sounds.
...
...