Visual tracking and recognition using appearance-adaptive models in particle filters

Abstract

We present an approach that incorporates appearance-adaptive models in a particle filter to realize robust visual tracking and recognition algorithms. Tracking needs modeling interframe motion and appearance changes, whereas recognition needs modeling appearance changes between frames and gallery images. In conventional tracking algorithms, the appearance model is either fixed or rapidly changing, and the motion model is simply a random walk with fixed noise variance. Also, the number of particles is typically fixed. All these factors make the visual tracker unstable. To stabilize the tracker, we propose the following modifications: an observation model arising from an adaptive appearance model, an adaptive velocity motion model with adaptive noise variance, and an adaptive number of particles. The adaptive-velocity model is derived using a first-order linear predictor based on the appearance difference between the incoming observation and the previous particle configuration. Occlusion analysis is implemented using robust statistics. Experimental results on tracking visual objects in long outdoor and indoor video sequences demonstrate the effectiveness and robustness of our tracking algorithm. We then perform simultaneous tracking and recognition by embedding them in a particle filter. For recognition purposes, we model the appearance changes between frames and gallery images by constructing the intra- and extrapersonal spaces. Accurate recognition is achieved when confronted by pose and view variations.

DOI: 10.1109/TIP.2004.836152

Extracted Key Phrases

13 Figures and Tables

020406080'03'05'07'09'11'13'15'17
Citations per Year

590 Citations

Semantic Scholar estimates that this publication has 590 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@article{Zhou2004VisualTA, title={Visual tracking and recognition using appearance-adaptive models in particle filters}, author={Shaohua Kevin Zhou and Rama Chellappa and Baback Moghaddam}, journal={IEEE Transactions on Image Processing}, year={2004}, volume={13}, pages={1491-1506} }