Audio Surveillance

  title={Audio Surveillance},
  author={Marco Crocco and Marco Cristani and Andrea Trucco and Vittorio Murino},
  journal={ACM Computing Surveys (CSUR)},
  pages={1 - 46}
Despite surveillance systems becoming increasingly ubiquitous in our living environment, automated surveillance, currently based on video sensory modality and machine intelligence, lacks most of the time the robustness and reliability required in several real applications. To tackle this issue, audio sensory devices have been incorporated, both alone or in combination with video, giving birth in the past decade, to a considerable amount of research. In this article, audio-based automated… 

Audio Surveillance System

Real-time audio surveillance system where the audio signals are picked up using highly sensitive microphones and then the signal is amplified to a great extent to get clear insights of what a monitored person is speaking about.

Environmental Audio Scene and Sound Event Recognition for Autonomous Surveillance

Various features of representing audio scenes and sound events are reviewed and appropriate machine learning algorithms for audio surveillance tasks are provided and some of the state-of-the-art approaches are evaluated based on two benchmark datasets for audio Scenes and sound event recognition tasks.

Forensic Audio Analysis and Event Recognition for Smart Surveillance Systems

  • Yusuf OzkanB. Barkana
  • Computer Science
    2019 IEEE International Symposium on Technologies for Homeland Security (HST)
  • 2019
This study proposes forensic audio analysis and event recognition framework for smart surveillance systems and designs K-nearest-neighbor (KNN), support vector machines (SVMs), Gaussian Mixture Model (GMM), and classifier fusion classifiers which achieved promising accuracy rates.

Robust Classification of Abnormal Audio using Background-Foreground Separation

  • CK MeghaV. K. Reddy
  • Computer Science
    2017 14th IEEE India Council International Conference (INDICON)
  • 2017
The proposed methodology focuses on reducing false alarm rates by implementing a Background-Foreground Separation module, which is validated on a large publicly available dataset which is developed for acoustic event classification.

SoReNet: a novel deep network for audio surveillance applications

A method for automatically analyzing the audio stream for surveillance purposes: it is able to detect the presence of abnormal events such as screams, gun shots and broken glasses, using a Convolutional Neural Network with the following two main properties.

Gaussian filter for TDOA based sound source localization in multimedia surveillance

The novel aspect of the approach includes a TDOA based Gaussian filter to improve the accuracy and stability of sound source localization and shows significant improvement over the state of the art TDOA-based algorithm.

Audio-Based Event Detection at Different SNR Settings Using Two-Dimensional Spectrogram Magnitude Representations

In this manuscript, an extensive analysis focused on the comparison of different magnitude representations of the raw audio is presented, and the generalization of the proposed methods on datasets that were both seen and not seen during training are studied and reported.

DENet: a deep architecture for audio surveillance applications

A novel recurrent convolutional neural network architecture, named DENet, based on a new layer that is called denoising-enhancement (DE) layer, which performsDenoising and enhancement of the original signal by applying an attention map on the components of the band-filtered signal.

Toulouse campus surveillance dataset: scenarios, soundtracks, synchronized videos with overlapping and disjoint views

A new dataset for evaluating multi-view based methods, aimed at paving the way for multidisciplinary approaches and applications such as 4D-scene reconstruction, object identification/tracking, audio event detection and multi-source meta-data modeling and querying is introduced.

Detecting Sounds of Interest in Roads with Deep Networks

This paper designs a recognition system based on analysis of audio frames and tested it on the publicly available MIVIA road events data set, demonstrating that the proposed approach can be deployed on embedded devices in a distributed surveillance system.



On-line adaptive background modelling for audio surveillance

This work introduces the concepts of audio BG and FG in an automated surveillance scenario, and proposes a novel audio BG system able to build in real time an adaptive model of the audio scene BG and to promptly detect unexpected FG auditory events.

Events Detection for an Audio-Based Surveillance System

The automatic shot detection system presented is based on a novelty detection approach which offers a solution to detect abnormality (abnormal audio events) in continuous audio recordings of public places and takes advantage of potential similarity between the acoustic signatures of the different types of weapons by building a hierarchical classification system.

Audio analysis for surveillance applications

The proposed hybrid solution is capable of detecting new kinds of suspicious audio events that occur as outliers against a background of usual activity and adaptively learns a Gaussian mixture model to model the background sounds and updates the model incrementally as new audio data arrives.

Probabilistic Novelty Detection for Acoustic Surveillance Under Real-World Conditions

Probabilistic novelty detection can provide an accurate analysis of the audio scene to identify abnormal events and is explored as applied to acoustic surveillance of abnormal situations.

A Multimodal Audio Visible and Infrared Surveillance System (MAVISS)

  • A. MittalP. Kumar
  • Computer Science
    2005 3rd International Conference on Intelligent Sensing and Information Processing
  • 2005
Semantic rules are proposed to identify alarming events by using information from audio and video module of a low cost surveillance system employing multimodal information (visible, infrared and audio signals) for monitoring small area and detecting alarming events.

Multimodal surveillance based personal protection system

A novel, multimodal approach for automatic detection of abduction of a protected individual, employing dedicated personal protection device and a city monitoring system is proposed and overviewed.

Survey on Contemporary Remote Surveillance Systems for Public Safety

  • Tomi D. Räty
  • Computer Science
    IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews)
  • 2010
The historical development and current state of the three different generations of contemporary surveillance systems are reviewed, decomposed into multisensor environments, video and audio surveillance, wireless sensor networks, distributed intelligence and awareness, architecture and middleware, and the utilization of mobile robots.

Real-time audio-visual localization of user using microphone array and vision camera

This paper proposes a novel audio visual user localization system that consists of a microphone array with eight sensors and a video camera, and a robust method for face detection is proposed to double check the user based on an Adaboost classifier.

Intelligent distributed surveillance systems: a review

This survey describes the current state-of-the-art in the development of automated visual surveillance systems so as to provide researchers in the field with a summary of progress achieved to date

Audio Based Event Detection for Multimedia Surveillance

The results show that the proposed top-down event detection approach works significantly better than the single level approach.