Places: A 10 Million Image Database for Scene Recognition

@article{Zhou2018PlacesA1,
  title={Places: A 10 Million Image Database for Scene Recognition},
  author={Bolei Zhou and {\`A}. Lapedriza and A. Khosla and A. Oliva and A. Torralba},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
  year={2018},
  volume={40},
  pages={1452-1464}
}
The rise of multi-million-item dataset initiatives has enabled data-hungry machine learning algorithms to reach near-human semantic classification performance at tasks such as visual object and scene recognition. [...] Key Method Using the state-of-the-art Convolutional Neural Networks (CNNs), we provide scene classification CNNs (Places-CNNs) as baselines, that significantly outperform the previous approaches. Visualization of the CNNs trained on Places shows that object detectors emerge as an intermediate…Expand
Semantic-Aware Scene Recognition
TLDR
A novel approach for scene recognition based on an end-to-end multi-modal CNN that combines image and context information by means of an attention module is described, which outperforms every other state-of-the-art method while significantly reducing the number of network parameters. Expand
Recognize Various Scenes using Classification methods in Million Images Dataset: A Review
TLDR
This paper surveys on different techniques of scene recognition from single image to multi-million images, and performs a task for context for the object recognition. Expand
Scene Classification in Indoor Environments for Robots using Context Based Word Embeddings
TLDR
An approach which combines traditional deep learning techniques with natural language processing methods to generate a word embedding based Scene Classification algorithm which addresses indoor Scene Classification task using a model trained with a reduced pre-processed version of the Places365 dataset. Expand
Scene Retrieval for Contextual Visual Mapping
Visual navigation localizes a query place image against a reference database of place images, also known as a ‘visual map’. Localization accuracy requirements for specific areas of the visual map,Expand
Learning Scene Attribute for Scene Recognition
TLDR
This paper discusses the discrimination of scene attributes in local regions and utilize scene attributes as the complementary features of object and scene features and aggregate these features and generate more discriminative scene representations, which achieve better performance than the feature aggregation ofobject and scene. Expand
Multi-Level Ensemble Network for Scene Recognition
TLDR
Multi-Level Ensemble Network (MLEN), a convolutional neural network, has been proposed, to improve the recognition accuracy of these “small object-supported scenes” and a class-weight loss function for the problem of non-uniform class distribution has been designed. Expand
Visual Semantic-Based Representation Learning Using Deep CNNs for Scene Recognition
TLDR
This work proposes an approach for generating pseudo-concepts in the absence of true concept labels using pre-trained deep CNN-based architectures where activation maps (filter responses) from convolutional layers are considered as initial cues to the pseudo- Concept models. Expand
Knowledge Guided Disambiguation for Large-Scale Scene Classification With Multi-Resolution CNNs
TLDR
A multi-resolution CNN architecture that captures visual content and structure at multiple levels is proposed and two knowledge guided disambiguation techniques to deal with the problem of label ambiguity are designed. Expand
RGB-D Scene Recognition based on Object-Scene Relation (Student Abstract)
TLDR
A RGB-D scene recognition model based on object-scene relation (RSBR) and an Appearance Network, which predicts the scene label from the appearance information between objects, is proposed for scene recognition. Expand
Is Whole Object Information Helpful for Scene Recognition?
TLDR
It is found that some object classes may contribute to disturbing scene recognition, which indicates that not only making good use of object information, but also dropping disturbed object information is also important. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 47 REFERENCES
Learning Deep Features for Scene Recognition using Places Database
TLDR
A new scene-centric database called Places with over 7 million labeled pictures of scenes is introduced with new methods to compare the density and diversity of image datasets and it is shown that Places is as dense as other scene datasets and has more diversity. Expand
Object Detectors Emerge in Deep Scene CNNs
TLDR
This work demonstrates that the same network can perform both scene recognition and object localization in a single forward-pass, without ever having been explicitly taught the notion of objects. Expand
SUN database: Large-scale scene recognition from abbey to zoo
TLDR
This paper proposes the extensive Scene UNderstanding (SUN) database that contains 899 categories and 130,519 images and uses 397 well-sampled categories to evaluate numerous state-of-the-art algorithms for scene recognition and establish new bounds of performance. Expand
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition
TLDR
For certain classes that are particularly prevalent in the dataset, such as people, this work is able to demonstrate a recognition performance comparable to class-specific Viola-Jones style detectors. Expand
Scene Parsing through ADE20K Dataset
TLDR
The ADE20K dataset, spanning diverse annotations of scenes, objects, parts of objects, and in some cases even parts of parts, is introduced and it is shown that the trained scene parsing networks can lead to applications such as image content removal and scene synthesis. Expand
Microsoft COCO: Common Objects in Context
We present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of sceneExpand
CNN Features Off-the-Shelf: An Astounding Baseline for Recognition
TLDR
A series of experiments conducted for different recognition tasks using the publicly available code and model of the OverFeat network which was trained to perform object classification on ILSVRC13 suggest that features obtained from deep learning with convolutional nets should be the primary candidate in most visual recognition tasks. Expand
ImageNet Large Scale Visual Recognition Challenge
TLDR
The creation of this benchmark dataset and the advances in object recognition that have been possible as a result are described, and the state-of-the-art computer vision accuracy with human accuracy is compared. Expand
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
  • S. Lazebnik, C. Schmid, J. Ponce
  • Computer Science
  • 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06)
  • 2006
TLDR
This paper presents a method for recognizing scene categories based on approximate global geometric correspondence that exceeds the state of the art on the Caltech-101 database and achieves high accuracy on a large database of fifteen natural scene categories. Expand
SUN attribute database: Discovering, annotating, and recognizing scene attributes
TLDR
This paper performs crowd-sourced human studies to find a taxonomy of 102 discriminative attributes and builds the “SUN attribute database” on top of the diverse SUN categorical database, which has potential for use in high-level scene understanding and fine-grained scene recognition. Expand
...
1
2
3
4
5
...