Short-term audio-visual atoms for generic video concept classification

Abstract

We investigate the challenging issue of joint audio-visual analysis of generic videos targeting at semantic concept detection. We propose to extract a novel representation, the Short-term Audio-Visual Atom (S-AVA), for improved concept detection. An S-AVA is defined as a short-term region track associated with regional visual features and background audio… (More)
DOI: 10.1145/1631272.1631277

Topics

11 Figures and Tables

Statistics

01020201020112012201320142015201620172018
Citations per Year

86 Citations

Semantic Scholar estimates that this publication has 86 citations based on the available data.

See our FAQ for additional information.

Slides referencing similar topics