Sparselet Models for Efficient Multiclass Object Detection


We develop an intermediate representation for deformable part models and show that this representation has favorable performance characteristics for multi-class problems when the number of classes is high. Our model uses sparse coding of part filters to represent each filter as a sparse linear combination of shared dictionary elements. This leads to a universal set of parts that are shared among all object classes. Reconstruction of the original part filter responses via sparse matrix-vector product reduces computation relative to conventional part filter convolutions. Our model is well suited to a parallel implementation, and we report a new GPU DPM implementation that takes advantage of sparse coding of part filters. The speed-up offered by our intermediate representation and parallel computation enable real-time DPM detection of 20 different object classes on a laptop computer.

DOI: 10.1007/978-3-642-33709-3_57

Extracted Key Phrases

6 Figures and Tables

Citations per Year

83 Citations

Semantic Scholar estimates that this publication has 83 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@inproceedings{Song2012SparseletMF, title={Sparselet Models for Efficient Multiclass Object Detection}, author={Hyun Oh Song and Stefan Zickler and Tim Althoff and Ross B. Girshick and Mario Fritz and Christopher Geyer and Pedro F. Felzenszwalb and Trevor Darrell}, booktitle={ECCV}, year={2012} }