ImageNet Large Scale Visual Recognition Challenge
@article{Russakovsky2015ImageNetLS, title={ImageNet Large Scale Visual Recognition Challenge}, author={Olga Russakovsky and Jia Deng and Hao Su and Jonathan Krause and Sanjeev Satheesh and Sean Ma and Zhiheng Huang and Andrej Karpathy and Aditya Khosla and Michael S. Bernstein and Alexander C. Berg and Li Fei-Fei}, journal={International Journal of Computer Vision}, year={2015}, volume={115}, pages={211-252} }
The ImageNet Large Scale Visual Recognition Challenge is a benchmark in object category classification and detection on hundreds of object categories and millions of images. [] Key Result We conclude with lessons learned in the 5 years of the challenge, and propose future directions and improvements.
25,578 Citations
Techniques for Image Classification on Tiny-ImageNet
- Computer Science
- 2017
This work worked with a smaller dataset, TinyImageNet, and attempted to train an image classifier using this data, and applied a wide variety of techniques to achieve a high classification accuracy on Tiny-ImageNet.
LSDA: Large Scale Detection through Adaptation
- Computer ScienceNIPS
- 2014
This paper proposes Large Scale Detection through Adaptation (LSDA), an algorithm which learns the difference between the two tasks and transfers this knowledge to classifiers for categories without bounding box annotated data, turning them into detectors.
From Large-Scale Object Classifiers to Large-Scale Object Detectors: An Adaptation Approach
- Computer Science
- 2014
This paper proposes a Deep Detection Adaptation (DDA) algorithm which learns the difference between the two tasks and transfers this knowledge to classifiers for categories without bounding box annotated data, turning them into detectors.
Immediate, Scalable Object Category Detection
- Computer Science2014 IEEE Conference on Computer Vision and Pattern Recognition
- 2014
A new image representation based on mid-level discriminative patches is designed to be suited to immediate object category detection and inverted file indexing and a fast method for spatial reranking images on their detections is demonstrated.
An Evolution of CNN Object Classifiers on Low-Resolution Images
- Computer Science2020 IEEE 17th International Conference on Smart Communities: Improving Quality of Life Using ICT, IoT and AI (HONET)
- 2020
This work investigates an optimal architecture that accurately classifies low-quality images using DCNNs architectures and evaluates the proposed architecture by implementing popular CNN architectures, finding that the MobileNet architecture delivers better than most of the available CNN architectures for low-resolution webcam image datasets.
Deep Learning for Generic Object Detection: A Survey
- Computer ScienceInternational Journal of Computer Vision
- 2019
A comprehensive survey of the recent achievements in this field brought about by deep learning techniques, covering many aspects of generic object detection: detection frameworks, object feature representation, object proposal generation, context modeling, training strategies, and evaluation metrics.
Very Deep Convolutional Networks for Large-Scale Image Recognition
- Computer ScienceICLR
- 2015
This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
Person Classification with Convolutional Neural Networks
- Computer Science
- 2016
Results emphasize the complexity of convolutional neural network architecture choice and indicate increased robustness in training with subclass labeling with regards to initialization and solver algorithms.
Large Scale Visual Recognition through Adaptation using Joint Representation and Multiple Instance Learning
- Computer ScienceJ. Mach. Learn. Res.
- 2016
This work provides a novel formulation of a joint multiple instance learning method that includes examples from object-centric data with image-level labels when available, and also performs domain transfer learning to improve the underlying detector representation.
Boosted Object Detection Based on Local Features
- Computer Science
- 2016
The Generalization and Efficiency Balance framework, and the hierarchical weak classifier are designed for this target, and as a result, the resulting boosted classifiers not only achieve high accuracy, but also have good generalization and efficiency.
References
SHOWING 1-10 OF 139 REFERENCES
The Pascal Visual Object Classes (VOC) Challenge
- Computer ScienceInternational Journal of Computer Vision
- 2009
The state-of-the-art in evaluated methods for both classification and detection are reviewed, whether the methods are statistically different, what they are learning from the images, and what the methods find easy or confuse.
Very Deep Convolutional Networks for Large-Scale Image Recognition
- Computer ScienceICLR
- 2015
This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
Learning Deep Features for Scene Recognition using Places Database
- Computer ScienceNIPS
- 2014
A new scene-centric database called Places with over 7 million labeled pictures of scenes is introduced with new methods to compare the density and diversity of image datasets and it is shown that Places is as dense as other scene datasets and has more diversity.
Caltech-256 Object Category Dataset
- Computer Science
- 2007
A challenging set of 256 object categories containing a total of 30607 images is introduced and the clutter category is used to train an interest detector which rejects uninformative background regions.
SUN database: Large-scale scene recognition from abbey to zoo
- Computer Science2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition
- 2010
This paper proposes the extensive Scene UNderstanding (SUN) database that contains 899 categories and 130,519 images and uses 397 well-sampled categories to evaluate numerous state-of-the-art algorithms for scene recognition and establish new bounds of performance.
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
- Computer Science2014 IEEE Conference on Computer Vision and Pattern Recognition
- 2014
This paper proposes a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous best result on VOC 2012 -- achieving a mAP of 53.3%.
The Pascal Visual Object Classes Challenge: A Retrospective
- Computer ScienceInternational Journal of Computer Vision
- 2014
A review of the Pascal Visual Object Classes challenge from 2008-2012 and an appraisal of the aspects of the challenge that worked well, and those that could be improved in future challenges.
Large-scale image classification: Fast feature extraction and SVM training
- Computer ScienceCVPR 2011
- 2011
A parallel averaging stochastic gradient descent (ASGD) algorithm for training one-against-all 1000-class SVM classifiers and a Hadoop scheme that performs feature extraction in parallel using hundreds of mappers, which achieves state-of-the-art performance on the ImageNet 1000- class classification.
Microsoft COCO: Common Objects in Context
- Computer ScienceECCV
- 2014
We present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene…
80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition
- Computer ScienceIEEE Transactions on Pattern Analysis and Machine Intelligence
- 2008
For certain classes that are particularly prevalent in the dataset, such as people, this work is able to demonstrate a recognition performance comparable to class-specific Viola-Jones style detectors.