• Publications
  • Influence
Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria
  • Tuomas Virtanen
  • Mathematics, Computer Science
  • IEEE Transactions on Audio, Speech, and Language…
  • 1 March 2007
TLDR
An unsupervised learning algorithm for the separation of sound sources in one-channel music signals is presented. Expand
  • 980
  • 84
  • PDF
TUT database for acoustic scene classification and sound event detection
TLDR
We introduce TUT Acoustic Scenes 2016 database for environmental sound research, consisting of binaural recordings from 15 different acoustic environments. Expand
  • 403
  • 62
  • PDF
DCASE 2017 Challenge setup: Tasks, datasets and baseline system
TLDR
DCASE 2017 Challenge consists of four tasks: acoustic scene classification , detection of rare sound events, sound event detection in real-life audio, and large-scale weakly supervised sound event Detection for smart cars. Expand
  • 327
  • 61
  • PDF
Sound Event Localization and Detection of Overlapping Sources Using Convolutional Recurrent Neural Networks
TLDR
In this paper, we propose a convolutional recurrent neural network for joint sound event localization and detection (SELD) of multiple overlapping sound events in three-dimensional space. Expand
  • 138
  • 35
  • PDF
Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection
TLDR
We combine CNN and RNN in a convolutional recurrent neural network (CRNN) and apply it on a polyphonic SED task. Expand
  • 297
  • 33
  • PDF
Metrics for Polyphonic Sound Event Detection
TLDR
This paper presents and discusses various metrics proposed for evaluation of polyphonic sound event detection systems used in realistic situations where there are typically multiple sound sources active simultaneously. Expand
  • 297
  • 29
  • PDF
Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition
TLDR
This paper proposes to use exemplar-based sparse representations to model speech corrupted by additive noise as a linear combination of noise and speech exemplars for noise robust automatic speech recognition. Expand
  • 371
  • 28
  • PDF
A multi-device dataset for urban acoustic scene classification
TLDR
This paper introduces the acoustic scene classification task of DCASE 2018 Challenge and the TUT Urban Acoustic Scenes 2018 dataset provided for the task, and evaluates the performance of a baseline system in the task. Expand
  • 162
  • 28
  • PDF
Recurrent neural networks for polyphonic sound event detection in real life recordings
TLDR
We present an approach to polyphonic sound event detection in real life recordings based on bi-directional long short term memory (BLSTM) recurrent neural networks (RNNs), trained to map acoustic features of a mixture signal consisting of sounds from multiple classes, to binary activity indicators of each event class. Expand
  • 235
  • 27
  • PDF
Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge
TLDR
Public evaluation campaigns and datasets promote active development in target research areas, allowing direct comparison of algorithms. Expand
  • 156
  • 17
  • PDF
...
1
2
3
4
5
...