Convolutive Speech Bases and Their Application to Supervised Speech Separation

  • Paris Smaragdis
  • Published 2007 in
    IEEE Transactions on Audio, Speech, and Language…


In this paper, we present a convolutive basis decomposition method and its application on simultaneous speakers separation from monophonic recordings. The model we propose is a convolutive version of the nonnegative matrix factorization algorithm. Due to the nonnegativity constraint this type of coding is very well suited for intuitively and efficiently representing magnitude spectra. We present results that reveal the nature of these basis functions and we introduce their utility in separating monophonic mixtures of known speakers

DOI: 10.1109/TASL.2006.876726

Extracted Key Phrases

8 Figures and Tables

Citations per Year

298 Citations

Semantic Scholar estimates that this publication has 298 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@article{Smaragdis2007ConvolutiveSB, title={Convolutive Speech Bases and Their Application to Supervised Speech Separation}, author={Paris Smaragdis}, journal={IEEE Transactions on Audio, Speech, and Language Processing}, year={2007}, volume={15}, pages={1-12} }