Places: A 10 Million Image Database for Scene Recognition
@article{Zhou2018PlacesA1, title={Places: A 10 Million Image Database for Scene Recognition}, author={Bolei Zhou and {\`A}gata Lapedriza and Aditya Khosla and Aude Oliva and Antonio Torralba}, journal={IEEE Transactions on Pattern Analysis and Machine Intelligence}, year={2018}, volume={40}, pages={1452-1464} }
The rise of multi-million-item dataset initiatives has enabled data-hungry machine learning algorithms to reach near-human semantic classification performance at tasks such as visual object and scene recognition. [] Key Method Using the state-of-the-art Convolutional Neural Networks (CNNs), we provide scene classification CNNs (Places-CNNs) as baselines, that significantly outperform the previous approaches. Visualization of the CNNs trained on Places shows that object detectors emerge as an intermediate…
Figures and Tables from this paper
2,515 Citations
Recognize Various Scenes using Classification methods in Million Images Dataset: A Review
- Computer Science
- 2019
This paper surveys on different techniques of scene recognition from single image to multi-million images, and performs a task for context for the object recognition.
Scene Classification in Indoor Environments for Robots using Context Based Word Embeddings
- Computer ScienceArXiv
- 2019
An approach which combines traditional deep learning techniques with natural language processing methods to generate a word embedding based Scene Classification algorithm which addresses indoor Scene Classification task using a model trained with a reduced pre-processed version of the Places365 dataset.
Scene Retrieval for Contextual Visual Mapping
- Computer ScienceArXiv
- 2021
Visual navigation localizes a query place image against a reference database of place images, also known as a ‘visual map’. Localization accuracy requirements for specific areas of the visual map,…
Learning Scene Attribute for Scene Recognition
- Computer ScienceIEEE Transactions on Multimedia
- 2020
This paper discusses the discrimination of scene attributes in local regions and utilize scene attributes as the complementary features of object and scene features and aggregate these features and generate more discriminative scene representations, which achieve better performance than the feature aggregation ofobject and scene.
Visual Semantic-Based Representation Learning Using Deep CNNs for Scene Recognition
- Computer ScienceACM Trans. Multim. Comput. Commun. Appl.
- 2021
This work proposes an approach for generating pseudo-concepts in the absence of true concept labels using pre-trained deep CNN-based architectures where activation maps (filter responses) from convolutional layers are considered as initial cues to the pseudo- Concept models.
Multi-Level Ensemble Network for Scene Recognition
- Computer ScienceMultimedia Tools and Applications
- 2019
Multi-Level Ensemble Network (MLEN), a convolutional neural network, has been proposed, to improve the recognition accuracy of these “small object-supported scenes” and a class-weight loss function for the problem of non-uniform class distribution has been designed.
Deep Learning for Scene Classification: A Survey
- Computer ScienceArXiv
- 2021
A comprehensive survey of recent achievements in scene classification using deep learning covering different aspects of scene classification, including challenges, benchmark datasets, taxonomy, and quantitative performance comparisons of the reviewed methods is provided.
Knowledge Guided Disambiguation for Large-Scale Scene Classification With Multi-Resolution CNNs
- Computer ScienceIEEE Transactions on Image Processing
- 2017
A multi-resolution CNN architecture that captures visual content and structure at multiple levels is proposed and two knowledge guided disambiguation techniques to deal with the problem of label ambiguity are designed.
References
SHOWING 1-10 OF 46 REFERENCES
Learning Deep Features for Scene Recognition using Places Database
- Computer ScienceNIPS
- 2014
A new scene-centric database called Places with over 7 million labeled pictures of scenes is introduced with new methods to compare the density and diversity of image datasets and it is shown that Places is as dense as other scene datasets and has more diversity.
SUN database: Large-scale scene recognition from abbey to zoo
- Computer Science2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition
- 2010
This paper proposes the extensive Scene UNderstanding (SUN) database that contains 899 categories and 130,519 images and uses 397 well-sampled categories to evaluate numerous state-of-the-art algorithms for scene recognition and establish new bounds of performance.
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition
- Computer ScienceIEEE Transactions on Pattern Analysis and Machine Intelligence
- 2008
For certain classes that are particularly prevalent in the dataset, such as people, this work is able to demonstrate a recognition performance comparable to class-specific Viola-Jones style detectors.
Scene Parsing through ADE20K Dataset
- Computer Science2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- 2017
The ADE20K dataset, spanning diverse annotations of scenes, objects, parts of objects, and in some cases even parts of parts, is introduced and it is shown that the trained scene parsing networks can lead to applications such as image content removal and scene synthesis.
Microsoft COCO: Common Objects in Context
- Computer ScienceECCV
- 2014
We present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene…
CNN Features Off-the-Shelf: An Astounding Baseline for Recognition
- Computer Science2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops
- 2014
A series of experiments conducted for different recognition tasks using the publicly available code and model of the OverFeat network which was trained to perform object classification on ILSVRC13 suggest that features obtained from deep learning with convolutional nets should be the primary candidate in most visual recognition tasks.
ImageNet Large Scale Visual Recognition Challenge
- Computer ScienceInternational Journal of Computer Vision
- 2015
The creation of this benchmark dataset and the advances in object recognition that have been possible as a result are described, and the state-of-the-art computer vision accuracy with human accuracy is compared.
The Cityscapes Dataset for Semantic Urban Scene Understanding
- Computer Science, Environmental Science2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
- 2016
This work introduces Cityscapes, a benchmark suite and large-scale dataset to train and test approaches for pixel-level and instance-level semantic labeling, and exceeds previous attempts in terms of dataset size, annotation richness, scene variability, and complexity.
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
- Computer Science2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06)
- 2006
This paper presents a method for recognizing scene categories based on approximate global geometric correspondence that exceeds the state of the art on the Caltech-101 database and achieves high accuracy on a large database of fifteen natural scene categories.
SUN attribute database: Discovering, annotating, and recognizing scene attributes
- Computer Science2012 IEEE Conference on Computer Vision and Pattern Recognition
- 2012
This paper performs crowd-sourced human studies to find a taxonomy of 102 discriminative attributes and builds the “SUN attribute database” on top of the diverse SUN categorical database, which has potential for use in high-level scene understanding and fine-grained scene recognition.