We investigate recursive nearest neighbor search in a sparse domain at the scale of audio signals. Essentially, to approximate the cosine distance between the signals we make pairwise comparisons between the elements of localized sparse models built from large and redundant multiscale dictionaries of time-frequency atoms. Theoretically, error bounds on… (More)
Figure 9: Log spectrograms of the query signals with which we search. Top: query of male saying “cheese.” Middle: query distorted with additive white Gaussian noise (AWGN) with SNR = −10 dB. Bottom: query distorted with interfering crow sound with SNR = −5 dB.