LabelMe: A Database and Web-Based Tool for Image Annotation

Abstract

We seek to build a large collection of images with ground truth labels to be used for object detection and recognition research. Such data is useful for supervised learning and quantitative evaluation. To achieve this, we developed a web-based tool that allows easy image annotation and instant sharing of such annotations. Using this annotation tool, we have collected a large dataset that spans many object categories, often containing multiple instances over a wide variety of images. We quantify the contents of the dataset and compare against existing state of the art datasets used for object recognition and detection. Also, we show how to extend the dataset to automatically enhance object labels with WordNet, discover object parts, recover a depth ordering of objects in a scene, and increase the number of labels using minimal user supervision and images from the web.

DOI: 10.1007/s11263-007-0090-8

Extracted Key Phrases

12 Figures and Tables

Showing 1-10 of 44 references

The CBCL-Streetscenes dataset can be downloaded at http : //cbcl

  • S Bileschi
  • 2006
1 Excerpt

The Caltech-256

  • A D Griffin, P Holub, Perona
  • 2006
Showing 1-10 of 1,233 extracted citations
0100200300'06'07'08'09'10'11'12'13'14'15'16'17
Citations per Year

2,212 Citations

Semantic Scholar estimates that this publication has received between 2,002 and 2,445 citations based on the available data.

See our FAQ for additional information.