The Pascal Visual Object Classes (VOC) Challenge

@article{Everingham2009ThePV,
  title={The Pascal Visual Object Classes (VOC) Challenge},
  author={Mark Everingham and Luc Van Gool and Christopher K. I. Williams and John M. Winn and Andrew Zisserman},
  journal={International Journal of Computer Vision},
  year={2009},
  volume={88},
  pages={303-338}
}
The Pascal Visual Object Classes (VOC) challenge is a benchmark in visual object category recognition and detection, providing the vision and machine learning communities with a standard dataset of images and annotation, and standard evaluation procedures. [...] Key Method We review the state-of-the-art in evaluated methods for both classification and detection, analyse whether the methods are statistically different, what they are learning from the images (e.g. the object or its context), and what the methods…Expand
The Pascal Visual Object Classes Challenge: A Retrospective
TLDR
A review of the Pascal Visual Object Classes challenge from 2008-2012 and an appraisal of the aspects of the challenge that worked well, and those that could be improved in future challenges.
ImageNet Large Scale Visual Recognition Challenge
TLDR
The creation of this benchmark dataset and the advances in object recognition that have been possible as a result are described, and the state-of-the-art computer vision accuracy with human accuracy is compared.
Object recognition using CNN
TLDR
This project has implemented a method to carry out object recognition in a weakly supervised manner, using partially annotated data-set, and validated by experimenting with different architectures that mere information of presence/ absence of objects in an image does provide their location information for free.
Object class detection: A survey
TLDR
A comprehensive survey of the recent technical achievements in object class detection research, covering different aspects of the research, including core techniques: appearance modeling, localization strategies, and supervised classification methods.
Multi-class Object Layout with Unsupervised Image Classification and Object Localization
TLDR
This paper provides the analysis of an exemplar-based approach that leverages unsupervised clustering for classification purpose, and sliding window matching for localization that achieves robustness by introducing a novel Conditional Random Field model that facilitates a straightforward accept/reject decision of the localized object classes.
Object Detection with Semi-local Features
TLDR
The results indicate the high potential of semi-local features to assist object detection systems and show that a significant difference exists between different feature extraction methods.
Taxonomy augmented object recognition
TLDR
This work exploits the semantic relationships embedded in object taxonomy to augment the learning of object classifiers and utilizes two types of relationships in the taxonomy, including the overall relationship and the local relationship.
Object Proposal Generation for Unsupervised Object Localization
TLDR
The experimental analysis demonstrates that the performance of these methods depends on the number of candidates and the threshold of recall, meanwhile the methods that generate the candidates with scores perform better than those methods that do not with scores.
A Bottom-Up Approach for Learning Visual Object Detection Models from Unreliable Sources
TLDR
This paper proposes a bottom-up approach for learning object detection models from weakly annotated samples, i.e., only category labels are given per image, and shows that promising object detection results can be achieved by the proposed method.
Region-oriented convolutional networks for object retrieval
TLDR
This thesis builds up on existing work presented at the TRECVID Instance Search Task in 2014, and explores the use of local deep learning features extracted from object proposals, and presents the guidelines to follow to ne-tune a convolutional neural network for tasks such as image classi cation, object detection and semantic segmentation.
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 76 REFERENCES
An Exemplar Model for Learning Object Classes
TLDR
An exemplar model that can learn and generate a region of interest around class instances in a training set, given only a set of images containing the visual class, which enables the detection of multiple instances of the object class in test images.
The 2005 PASCAL Visual Object Classes Challenge
TLDR
This chapter provides details of the datasets, algorithms used by the teams, evaluation criteria, and results achieved in the PASCAL Visual Object Classes Challenge.
Semantic Hierarchies for Visual Object Recognition
TLDR
The semantics of image labels are used to integrate prior knowledge about inter-class relationships into the visual appearance learning and to build and train a semantic hierarchy of discriminative classifiers and how to use it to perform object detection.
The PASCAL visual object classes challenge 2006 (VOC2006) results
This report presents the results of the 2006 PASCAL Visual Object Classes Challenge (VOC2006). Details of the challenge, data, and evaluation are presented. Participants in the challenge submitted
Caltech-256 Object Category Dataset
We introduce a challenging set of 256 object categories containing a total of 30607 images. The original Caltech-101 [1] was collected by choosing a set of object categories, downloading examples
Learning object categories from Google's image search
TLDR
A new model, TSI-pLSA, is developed, which extends pLSA (as applied to visual words) to include spatial information in a translation and scale invariant manner, and can handle the high intra-class variability and large proportion of unrelated images returned by search engines.
Techniques for Image Classification, Object Detection and Object Segmentation
TLDR
The techniques which the method used to participate in the PASCAL NoE VOC Challenge 2007 image analysis performance evaluation campaign produced comparatively good performance, and the method's segmentation accuracy was the best of all submissions.
Sharing Visual Features for Multiclass and Multiview Object Detection
TLDR
A multitask learning procedure, based on boosted decision stumps, that reduces the computational and sample complexity by finding common features that can be shared across the classes (and/or views) and considerably reduce the computational cost of multiclass object detection.
Decomposition, discovery and detection of visual categories using topic models
  • Mario Fritz, B. Schiele
  • Computer Science
    2008 IEEE Conference on Computer Vision and Pattern Recognition
  • 2008
TLDR
The proposed speed-ups make the system scale to large databases and improve the state-of-the-art in unsupervised learning as well as supervised detection on the challenging PASCALpsila06 multi-class detection tasks for several categories.
Fisher Kernels on Visual Vocabularies for Image Categorization
  • F. Perronnin, C. Dance
  • Mathematics, Computer Science
    2007 IEEE Conference on Computer Vision and Pattern Recognition
  • 2007
TLDR
This work shows that Fisher kernels can actually be understood as an extension of the popular bag-of-visterms, and proposes to apply this framework to image categorization where the input signals are images and where the underlying generative model is a visual vocabulary: a Gaussian mixture model which approximates the distribution of low-level features in images.
...
1
2
3
4
5
...