Clustering and synchronizing multi-camera video via landmark cross-correlation


We propose a method to both identify and synchronize multi-camera video recordings within a large collection of video and/or audio files. Landmark-based audio fingerprinting is used to match multiple recordings of the same event together and time-synchronize each file within the groups. Compared to prior work, we offer improvements towards event identification and a new synchronization refinement method that resolves inconsistent estimates and allows non-overlapping content to be synchronized within larger groups of recordings. Furthermore, the audio fingerprinting-based synchronization is shown to be equivalent to an efficient and scalable time-difference-of-arrival method using cross-correlation performed on a non-linearly transformed signal.

DOI: 10.1109/ICASSP.2012.6288396

Extracted Key Phrases

6 Figures and Tables

Cite this paper

@article{Bryan2012ClusteringAS, title={Clustering and synchronizing multi-camera video via landmark cross-correlation}, author={Nicholas J. Bryan and Paris Smaragdis and Gautham J. Mysore}, journal={2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, year={2012}, pages={2389-2392} }