• Corpus ID: 241035538

A Strongly-Labelled Polyphonic Dataset of Urban Sounds with Spatiotemporal Context

  title={A Strongly-Labelled Polyphonic Dataset of Urban Sounds with Spatiotemporal Context},
  author={Kenneth Ooi and Karn N. Watcharasupat and Santi Peksi and Furi Andi Karnapi and Zhen-Ting Ong and Danny Chua and Hui-Wen Leow and Li-Long Kwok and Xin-Lei Ng and Zhen-Ann Loh and Woonseng Gan},
  journal={2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)},
  • Kenneth OoiKarn N. Watcharasupat W. Gan
  • Published 3 November 2021
  • Computer Science
  • 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)
This paper introduces SINGA: PURA, a strongly labelled polyphonic urban sound dataset with spatiotemporal context. The data were collected via several recording units deployed across Singapore as a part of a wireless acoustic sensor network. These recordings were made as part of a project to identify and mitigate noise sources in Singapore, but also possess a wider applicability to sound event detection, classification, and localization. This paper introduces an accompanying hierarchical label… 

Figures and Tables from this paper

ARAUS: A Large-Scale Dataset and Baseline Models of Affective Responses to Augmented Urban Soundscapes

The ARAUS (Affective Responses to Augmented Urban Soundscapes) dataset, which comprises a cross-validation set and independent test set totaling 25,440 unique subjective perceptual responses to augmented soundscapes presented as audio-visual stimuli, is made publicly available.



SONYC-UST-V2: An Urban Sound Tagging Dataset with Spatiotemporal Context

The data collection procedure is described and evaluation metrics for multilabel classification of urban sound tags are proposed and the results of a simple baseline model that exploits spatiotemporal information are reported.

A Dataset and Taxonomy for Urban Sound Research

A taxonomy of urban sounds and a new dataset, UrbanSound, containing 27 hours of audio with 18.5 hours of annotated sound event occurrences across 10 sound classes are presented.

SONYC: a system for monitoring, analyzing, and mitigating urban noise pollution

Noise pollution is not merely an annoyance but an important problem with broad societal effects that apply to a significant portion of the population, and effective noise mitigation is in the public interest, with the promise of health, economic, and quality-of-life benefits.

Stadtlärm-A distributed System for Noise Level Measurement and Noise Source Identification in a Smart City Environment

Various types of acoustic scenes such as railway, road, airplane and industrial noise, construction sites, open air concerts, sport events and natural noise sources contribute to the overall rising

Extracting Urban Sound Information for Residential Areas in Smart Cities Using an End-to-End IoT System

An end-to-end Internet-of-Things (IoT) system that extracts real-time urban sound metadata using edge devices, providing information on the sound type, location and duration, rate of occurrence, loudness, and azimuth of a dominant noise in nine residential areas is presented.

USM-SED - A Dataset for Polyphonic Sound Event Detection in Urban Sound Monitoring Scenarios

This paper introduces a novel dataset for polyphonic sound event detection in urban sound monitoring use-cases. Based on isolated sounds taken from the FSD50K dataset, 20,000 polyphonic soundscapes


A feature vector is constructed based on the spatiotemporal metadata and used in parallel with log-mel spectrogram features to facilitate sound tagging and the presence of multiple annotations per recording is addressed by using a pseudo-labelling technique.

DESED-FL and URBAN-FL: Federated Learning Datasets for Sound Event Detection

The results indicate that FL is a promising approach for SED, but faces challenges with divergent data distributions inherent to distributed client edge devices.

A Low-cost Wireless Acoustic Sensor Network for the Classification of Urban Sounds

A wireless acoustic sensor network (WASN) that recognizes a set of sound events or classes from urban environments that is the first WASN running a CNN classifier over low-cost devices and achieves similar accuracy to other WASNs that perform the classification through cloud or edge computing.