Unsupervised Adversarial Domain Adaptation Based on The Wasserstein Distance For Acoustic Scene Classification

  title={Unsupervised Adversarial Domain Adaptation Based on The Wasserstein Distance For Acoustic Scene Classification},
  author={Konstantinos Drossos and Paul Magron and Tuomas Virtanen},
  journal={2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)},
  • K. Drossos, P. Magron, T. Virtanen
  • Published 24 April 2019
  • Computer Science
  • 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
A challenging problem in deep learning-based machine listening field is the degradation of the performance when using data from unseen conditions. In this paper we focus on the acoustic scene classification (ASC) task and propose an adversarial deep learning method to allow adapting an acoustic scene classification system to deal with a new acoustic channel resulting from data captured with a different recording device. We build upon the theoretical model of ℋΔℋ-distance and previous… 

Figures from this paper

Adversarial Domain Adaptation with Paired Examples for Acoustic Scene Classification on Different Recording Devices
This paper investigates several adversarial models for domain adaptation (DA) and their effect on the acoustic scene classification task, and finds that the best performing domain adaptation can be obtained using the cycle GAN, which achieves as much as 66% relative improvement in accuracy for the target domain device, while only 6 % relative decrease on the source domain.
Feature Projection-Based Unsupervised Domain Adaptation for Acoustic Scene Classification
This work proposes an unsupervised domain adaptation method for ASC based on the projection of spectro-temporal features extracted from both the source and target domain onto the principal subspace spanned by the eigenvectors of the sample covariance matrix of source-domain training data.
Towards Audio Domain Adaptation for Acoustic Scene Classification using Disentanglement Learning
A novel domain adaptation strategy based on disentanglement learning is proposed to disentangle task-specific and domain-specific characteristics in the analyzed audio recordings and a novel combination of categorical cross-entropy and variance-based losses is suggested.
Unsupervised Domain Adaptation for Acoustic Scene Classification Using Band-Wise Statistics Matching
An unsupervised domain adaptation method that consists of aligning the first- and second-order sample statistics of each frequency band of target-domain acoustic scenes to the ones of the source-domain training dataset is proposed to adapt audio samples from unseen devices before they are fed to a pre-trained classifier, thus avoiding any further learning phase.
Unsupervised Multi-Target Domain Adaptation for Acoustic Scene Classification
A novel unsupervised multi-target domain adaption (MTDA) method for ASC, which can adapt to multiple target domains simultaneously and make use of the underlying relation among multiple domains.
Prototypical Networks for Domain Adaptation in Acoustic Scene Classification
This work explores a metric learning approach called prototypical networks using the TUT Urban Acoustic Scenes dataset, which consists of 10 different acoustic scenes recorded across 10 cities, and concludes that metric learning is a promising approach towards addressing the domain adaptation problem in ASC.
A Review of Deep Learning Based Methods for Acoustic Scene Classification
This article summarizes and groups existing approaches for data preparation, i.e., feature representations, feature pre-processing, and data augmentation, and for data modeling, i.
VOICe: A Sound Event Detection Dataset For Generalizable Domain Adaptation
VOICe is presented, the first dataset for the development and evaluation of domain adaptation methods for sound event detection, which consists of mixtures with three different sound events over-imposed over three different categories of acoustic scenes: vehicle, outdoors, and indoors.
Relational Teacher Student Learning with Neural Label Embedding for Device Adaptation in Acoustic Scene Classification
A domain adaptation framework to address the device mismatch issue in acoustic scene classification leveraging upon neural label embedding (NLE) and relational teacher student learning (RTSL) and results confirm the effectiveness of the proposed approach for mismatch situations.
A Weighted Partial Domain Adaptation for Acoustic Scene Classification and Its Application in Fiber Optic Security System
A weighted partial domain adaptation method is proposed to solve the Acoustic Scene Classification (ASC) problem and is applied to an optical fiber perimeter security system to complete early warning by identifying intrusion signals.


Unsupervised adversarial domain adaptation for acoustic scene classification
The first method of unsupervised adversarial domain adaptation for acoustic scene classification is presented, which employs a model pre-trained on data from one set of conditions and by using data from other set of Conditions, which adapt the model in order that its output cannot be used for classifying the set of condition that input data belong to.
Adversarial Discriminative Domain Adaptation
It is shown that ADDA is more effective yet considerably simpler than competing domain-adversarial methods, and the promise of the approach is demonstrated by exceeding state-of-the-art unsupervised adaptation results on standard domain adaptation tasks as well as a difficult cross-modality object classification task.
Improving Adversarial Discriminative Domain Adaptation
The final design employs maximum mean discrepancy and reconstruction-based loss functions for adversarial training, and is both simple and efficient, as it competes or outperforms the state-of-the-art in unsupervised domain adaptation, whilst offering lower complexity than other recent adversarial methods such as DIFA and CoGAN.
Convolutional Neural Networks with Binaural Representations and Background Subtraction for Acoustic Scene Classification
The experimental results show that the proposed network structures and the preprocessing methods effectively learn acoustic characteristics from the audio recordings, and their ensemble model significantly reduces the error rate further.
Multi-Adversarial Domain Adaptation
A multi-adversarial domain adaptation (MADA) approach, which captures multimode structures to enable fine-grained alignment of different data distributions based on multiple domain discriminators and outperforms state of the art methods on standard domain adaptation datasets.
Domain-Adversarial Training of Neural Networks
A new representation learning approach for domain adaptation, in which data at training and test time come from similar but different distributions, which can be achieved in almost any feed-forward model by augmenting it with few standard layers and a new gradient reversal layer.
CyCADA: Cycle-Consistent Adversarial Domain Adaptation
A novel discriminatively-trained Cycle-Consistent Adversarial Domain Adaptation model that adapts representations at both the pixel-level and feature-level, enforces cycle-consistency while leveraging a task loss, and does not require aligned pairs is proposed.
Adversarial Multiple Source Domain Adaptation
This paper proposes multisource domain adversarial networks (MDAN) that approach domain adaptation by optimizing task-adaptive generalization bounds and conducts extensive experiments showing superior adaptation performance on both classification and regression problems: sentiment analysis, digit classification, and vehicle counting.
Unsupervised Domain Adaptation by Backpropagation
The method performs very well in a series of image classification experiments, achieving adaptation effect in the presence of big domain shifts and outperforming previous state-of-the-art on Office datasets.
Generative Adversarial Nets
We propose a new framework for estimating generative models via an adversarial process, in which we simultaneously train two models: a generative model G that captures the data distribution, and a