Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification

Abstract

Robust low-level image features have been proven to be effective representations for a variety of visual recognition tasks such as object recognition and scene classification; but pixels, or even local image patches, carry little semantic meanings. For high level visual tasks, such low-level image representations are potentially not enough. In this paper, we propose a high-level image representation, called the Object Bank, where an image is represented as a scale-invariant response map of a large number of pre-trained generic object detectors, blind to the testing dataset or visual task. Leveraging on the Object Bank representation, superior performances on high level visual recognition tasks can be achieved with simple off-the-shelf classifiers such as logistic regression and linear SVM. Sparsity algorithms make our representation more efficient and scalable for large scene datasets, and reveal semantically meaningful feature patterns.

Extracted Key Phrases

4 Figures and Tables

0501001502011201220132014201520162017
Citations per Year

806 Citations

Semantic Scholar estimates that this publication has 806 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@inproceedings{Li2010ObjectBA, title={Object Bank: A High-Level Image Representation for Scene Classification & Semantic Feature Sparsification}, author={Li-Jia Li and Hao Su and Eric P. Xing and Li Fei-Fei}, booktitle={NIPS}, year={2010} }