#### Filter Results:

- Full text PDF available (122)

#### Publication Year

1996

2017

- This year (14)
- Last 5 years (82)
- Last 10 years (122)

#### Publication Type

#### Co-author

#### Journals and Conferences

#### Data Set Used

#### Key Phrases

#### Method

Learn More

- Tuomas Virtanen
- IEEE Transactions on Audio, Speech, and Language…
- 2007

An unsupervised learning algorithm for the separation of sound sources in one-channel music signals is presented. The algorithm is based on factorizing the magnitude spectrogram of an input signal into a sum of components, each of which has a fixed magnitude spectrum and a time-varying gain. Each sound source, in turn, is modeled as a sum of one or more… (More)

- Annamaria Mesaros, Toni Heittola, Tuomas Virtanen
- 2016 24th European Signal Processing Conference…
- 2016

We introduce TUT Acoustic Scenes 2016 database for environmental sound research, consisting of binaural recordings from 15 different acoustic environments. A subset of this database, called TUT Sound Events 2016, contains annotations for individual sound events, specifically created for sound event detection. TUT Sound Events 2016 consists of residential… (More)

- Jort F. Gemmeke, Tuomas Virtanen, Antti Hurmalainen
- IEEE Transactions on Audio, Speech, and Language…
- 2011

This paper proposes to use exemplar-based sparse representations for noise robust automatic speech recognition. First, we describe how speech can be modeled as a linear combination of a small number of exemplars from a large speech exemplar dictionary. The exemplars are time-frequency patches of real speech, each spanning multiple time frames. We then… (More)

- Tuomas Virtanen, Ali Taylan Cemgil, Simon J. Godsill
- 2008 IEEE International Conference on Acoustics…
- 2008

We describe the underlying probabilistic generative signal model of non-negative matrix factorisation (NMF) and propose a realistic conjugate priors on the matrices to be estimated. A conjugate Gamma chain prior enables modelling the spectral smoothness of natural sounds in general, and other prior knowledge about the spectra of the sounds can be used… (More)

- Tuomas Virtanen, Jort F. Gemmeke, Bhiksha Raj
- IEEE Transactions on Audio, Speech, and Language…
- 2013

This paper proposes a computationally efficient algorithm for estimating the non-negative weights of linear combinations of the atoms of large-scale audio dictionaries, so that the generalized Kullback-Leibler divergence between an audio observation and the model is minimized. This linear model has been found useful in many audio signal processing tasks,… (More)

- Marko Leonard Helén, Tuomas Virtanen
- 2005 13th European Signal Processing Conference
- 2005

This paper presents a procedure for the separation of pitched musical instruments and drums from polyphonic music. The method is based on two-stage processing in which the input signal is first separated into elementary time-frequency components which are then organized into sound sources. Non-negative matrix factorization (NMF) is used to separate the… (More)

- Tuomas Virtanen, Anssi Klapuri
- ICASSP
- 2000

In this paper, an approach for the separation of harmonic sounds is described. The overall system consists of three components. Sinusoidal modeling is first used to analyze the mixed signal and to obtain the frequencies and amplitudes of sinusoidal spectral components. Then a new method is proposed for the calculation of the perceptual distance between… (More)

- Annamaria Mesaros, Tuomas Virtanen
- EURASIP J. Audio, Speech and Music Processing
- 2010

The paper considers the task of recognizing phonemes and words from a singing input by using a phonetic hidden Markov model recognizer. The system is targeted to both monophonic singing and singing in polyphonic music. A vocal separation algorithm is applied to separate the singing from polyphonic music. Due to the lack of annotated singing databases, the… (More)

- Bhiksha Raj, Tuomas Virtanen, Sourish Chaudhuri, Rita Singh
- INTERSPEECH
- 2010

This paper proposes to use non-negative matrix factorization based speech enhancement in robust automatic recognition of mixtures of speech and music. We represent magnitude spectra of noisy speech signals as the non-negative weighted linear combination of speech and noise spectral basis vectors, that are obtained from training corpora of speech and music.… (More)

- Tuomas Virtanen
- SAPA@INTERSPEECH
- 2004

An algorithm for the separation of sound sources is presented. Each source is parametrized as a convolution between a time-frequency magnitude spectrogam and an onset vector. The source model is able to represent several types of sounds, for example repetitive drum sounds and harmonic sounds with modulations. An iterative algorithm is proposed for the… (More)