Deep Transform: Time-Domain Audio Error Correction via Probabilistic Re-Synthesis

Abstract

In the process of recording, storage and transmission of time-domain audio signals, errors may be introduced that are difficult to correct in an unsupervised way. Here, we train a convolutional deep neural network to resynthesize input time-domain speech signals at its output layer. We then use this abstract transformation, which we call a deep transform (DT), to perform probabilistic re-synthesis on further speech (of the same speaker) which has been degraded. Using the convolutive DT, we demonstrate the recovery of speech audio that has been subject to extreme degradation. This approach may be useful for correction of errors in communications devices.

Extracted Key Phrases

1 Figure or Table

Cite this paper

@article{Simpson2015DeepTT, title={Deep Transform: Time-Domain Audio Error Correction via Probabilistic Re-Synthesis}, author={Andrew J. R. Simpson}, journal={CoRR}, year={2015}, volume={abs/1503.05849} }