Places: A 10 Million Image Database for Scene Recognition

  title={Places: A 10 Million Image Database for Scene Recognition},
  author={Bolei Zhou and {\`A}gata Lapedriza and Aditya Khosla and Aude Oliva and Antonio Torralba},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
The rise of multi-million-item dataset initiatives has enabled data-hungry machine learning algorithms to reach near-human semantic classification performance at tasks such as visual object and scene recognition. [] Key Method Using the state-of-the-art Convolutional Neural Networks (CNNs), we provide scene classification CNNs (Places-CNNs) as baselines, that significantly outperform the previous approaches. Visualization of the CNNs trained on Places shows that object detectors emerge as an intermediate…

Semantic-Aware Scene Recognition

Recognize Various Scenes using Classification methods in Million Images Dataset: A Review

This paper surveys on different techniques of scene recognition from single image to multi-million images, and performs a task for context for the object recognition.

Scene Classification in Indoor Environments for Robots using Context Based Word Embeddings

An approach which combines traditional deep learning techniques with natural language processing methods to generate a word embedding based Scene Classification algorithm which addresses indoor Scene Classification task using a model trained with a reduced pre-processed version of the Places365 dataset.

Scene Retrieval for Contextual Visual Mapping

Visual navigation localizes a query place image against a reference database of place images, also known as a ‘visual map’. Localization accuracy requirements for specific areas of the visual map,

Learning Scene Attribute for Scene Recognition

This paper discusses the discrimination of scene attributes in local regions and utilize scene attributes as the complementary features of object and scene features and aggregate these features and generate more discriminative scene representations, which achieve better performance than the feature aggregation ofobject and scene.

Visual Semantic-Based Representation Learning Using Deep CNNs for Scene Recognition

This work proposes an approach for generating pseudo-concepts in the absence of true concept labels using pre-trained deep CNN-based architectures where activation maps (filter responses) from convolutional layers are considered as initial cues to the pseudo- Concept models.

Multi-Level Ensemble Network for Scene Recognition

Multi-Level Ensemble Network (MLEN), a convolutional neural network, has been proposed, to improve the recognition accuracy of these “small object-supported scenes” and a class-weight loss function for the problem of non-uniform class distribution has been designed.

Deep Learning for Scene Classification: A Survey

A comprehensive survey of recent achievements in scene classification using deep learning covering different aspects of scene classification, including challenges, benchmark datasets, taxonomy, and quantitative performance comparisons of the reviewed methods is provided.

Knowledge Guided Disambiguation for Large-Scale Scene Classification With Multi-Resolution CNNs

A multi-resolution CNN architecture that captures visual content and structure at multiple levels is proposed and two knowledge guided disambiguation techniques to deal with the problem of label ambiguity are designed.



Learning Deep Features for Scene Recognition using Places Database

A new scene-centric database called Places with over 7 million labeled pictures of scenes is introduced with new methods to compare the density and diversity of image datasets and it is shown that Places is as dense as other scene datasets and has more diversity.

SUN database: Large-scale scene recognition from abbey to zoo

This paper proposes the extensive Scene UNderstanding (SUN) database that contains 899 categories and 130,519 images and uses 397 well-sampled categories to evaluate numerous state-of-the-art algorithms for scene recognition and establish new bounds of performance.

80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition

For certain classes that are particularly prevalent in the dataset, such as people, this work is able to demonstrate a recognition performance comparable to class-specific Viola-Jones style detectors.

Scene Parsing through ADE20K Dataset

The ADE20K dataset, spanning diverse annotations of scenes, objects, parts of objects, and in some cases even parts of parts, is introduced and it is shown that the trained scene parsing networks can lead to applications such as image content removal and scene synthesis.

Microsoft COCO: Common Objects in Context

We present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene

CNN Features Off-the-Shelf: An Astounding Baseline for Recognition

A series of experiments conducted for different recognition tasks using the publicly available code and model of the OverFeat network which was trained to perform object classification on ILSVRC13 suggest that features obtained from deep learning with convolutional nets should be the primary candidate in most visual recognition tasks.

ImageNet Large Scale Visual Recognition Challenge

The creation of this benchmark dataset and the advances in object recognition that have been possible as a result are described, and the state-of-the-art computer vision accuracy with human accuracy is compared.

The Cityscapes Dataset for Semantic Urban Scene Understanding

This work introduces Cityscapes, a benchmark suite and large-scale dataset to train and test approaches for pixel-level and instance-level semantic labeling, and exceeds previous attempts in terms of dataset size, annotation richness, scene variability, and complexity.

Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

  • S. LazebnikC. SchmidJ. Ponce
  • Computer Science
    2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06)
  • 2006
This paper presents a method for recognizing scene categories based on approximate global geometric correspondence that exceeds the state of the art on the Caltech-101 database and achieves high accuracy on a large database of fifteen natural scene categories.

SUN attribute database: Discovering, annotating, and recognizing scene attributes

This paper performs crowd-sourced human studies to find a taxonomy of 102 discriminative attributes and builds the “SUN attribute database” on top of the diverse SUN categorical database, which has potential for use in high-level scene understanding and fine-grained scene recognition.