Places: An Image Database for Deep Scene Understanding

@article{Zhou2016PlacesAI,
  title={Places: An Image Database for Deep Scene Understanding},
  author={Bolei Zhou and A. Khosla and {\`A}. Lapedriza and A. Torralba and A. Oliva},
  journal={ArXiv},
  year={2016},
  volume={abs/1610.02055}
}
The rise of multi-million-item dataset initiatives has enabled data-hungry machine learning algorithms to reach near-human semantic classification at tasks such as object and scene recognition. [...] Key Result With its high-coverage and high-diversity of exemplars, the Places Database offers an ecosystem to guide future progress on currently intractable visual recognition problems.Expand
Scene Recognition with Sequential Object Context
TLDR
A deep network architecture which models the sequential object context of scenes to capture object level information and incorporates object-object relationship and object-scene relationship in an end-to-end trainable manner is proposed. Expand
Multi-Scale Multi-Feature Context Modeling for Scene Recognition in the Semantic Manifold
TLDR
This paper proposes discriminative patch representations using neural networks and further proposes a hybrid architecture in which the semantic manifold is built on top of multiscale CNNs, which can be computed significantly faster than the Gaussian mixture models of the original SM. Expand
Deriving high-level scene descriptions from deep scene CNN features
  • Akram Bayat, M. Pomplun
  • Computer Science
  • 2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA)
  • 2017
TLDR
Two computational models are generated in order to estimate two dominant global properties (naturalness and openness) of an input image which can be predicted from activations in the lowest layer of the convolutional neural network which has been trained for a scene recognition task. Expand
Knowledge Guided Disambiguation for Large-Scale Scene Classification With Multi-Resolution CNNs
TLDR
A multi-resolution CNN architecture that captures visual content and structure at multiple levels is proposed and two knowledge guided disambiguation techniques to deal with the problem of label ambiguity are designed. Expand
Scene Recognition by Joint Learning of DNN from Bag of Visual Words and Convolutional DCT Features
TLDR
This paper presents a scene classification method in which local and global features are used and are concatenated with the DCT-Convolutional features of AlexNet, and it clearly outperforms in terms of accuracy. Expand
Exploring confusing scene classes for the places dataset: Insights and solutions
TLDR
This work proposes to use the filter weights at the last stage of a CNN model trained by the Places dataset to explain the source of confusions, and shows that, for a given baseline CNN, the ASC/RF scheme can offer a significant performance gain. Expand
Exploiting Class Hierarchies for Large-Scale Scene Classification Using Hybrid Discriminative Approach
TLDR
Model based on the idea of fine to coarse category mappings is proposed, whose information is combined with the fusion of feature descriptors resulting in a single feature representation that enhances performance by exploiting hierarchical relationship among the scene categories. Expand
Scene Image Representation by Foreground, Background and Hybrid Features
TLDR
This paper adopts three VGG-16 architectures pre-trained on ImageNet, Places, and Hybrid datasets for the corresponding extraction of foreground, background and hybrid information and produces the state-of-the-art classification performance. Expand
Exploring Object-Centric and Scene-Centric CNN Features and Their Complementarity for Human Rights Violations Recognition in Images
TLDR
The human rights archive (HRA) database is introduced, a verified-by-experts repository of 3050 human rights violations photographs, labeled with human rights semantic categories, comprising a list of the types of human rights abuses encountered at present. Expand
Learning Deep NBNN Representations for Robust Place Categorization
TLDR
Experimental results show that the proposed algorithm outperforms previous methods based on pretrained CNN models and that, when employed in challenging robot place recognition tasks, it is robust to occlusions, environmental and sensor changes. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 40 REFERENCES
Learning Deep Features for Scene Recognition using Places Database
TLDR
A new scene-centric database called Places with over 7 million labeled pictures of scenes is introduced with new methods to compare the density and diversity of image datasets and it is shown that Places is as dense as other scene datasets and has more diversity. Expand
Object Detectors Emerge in Deep Scene CNNs
TLDR
This work demonstrates that the same network can perform both scene recognition and object localization in a single forward-pass, without ever having been explicitly taught the notion of objects. Expand
Semantic Understanding of Scenes Through the ADE20K Dataset
TLDR
This work presents a densely annotated dataset ADE20K, which spans diverse annotations of scenes, objects, parts of objects, and in some cases even parts of parts, and shows that the networks trained on this dataset are able to segment a wide variety of scenes and objects. Expand
SUN attribute database: Discovering, annotating, and recognizing scene attributes
TLDR
This paper performs crowd-sourced human studies to find a taxonomy of 102 discriminative attributes and builds the “SUN attribute database” on top of the diverse SUN categorical database, which has potential for use in high-level scene understanding and fine-grained scene recognition. Expand
SUN database: Large-scale scene recognition from abbey to zoo
TLDR
This paper proposes the extensive Scene UNderstanding (SUN) database that contains 899 categories and 130,519 images and uses 397 well-sampled categories to evaluate numerous state-of-the-art algorithms for scene recognition and establish new bounds of performance. Expand
ImageNet Large Scale Visual Recognition Challenge
TLDR
The creation of this benchmark dataset and the advances in object recognition that have been possible as a result are described, and the state-of-the-art computer vision accuracy with human accuracy is compared. Expand
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition
TLDR
For certain classes that are particularly prevalent in the dataset, such as people, this work is able to demonstrate a recognition performance comparable to class-specific Viola-Jones style detectors. Expand
The Cityscapes Dataset for Semantic Urban Scene Understanding
TLDR
This work introduces Cityscapes, a benchmark suite and large-scale dataset to train and test approaches for pixel-level and instance-level semantic labeling, and exceeds previous attempts in terms of dataset size, annotation richness, scene variability, and complexity. Expand
Microsoft COCO: Common Objects in Context
We present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of sceneExpand
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
TLDR
This paper proposes a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous best result on VOC 2012 -- achieving a mAP of 53.3%. Expand
...
1
2
3
4
...