You Only Look Once: Unified, Real-Time Object Detection

@article{Redmon2016YouOL,
  title={You Only Look Once: Unified, Real-Time Object Detection},
  author={Joseph Redmon and S. Divvala and Ross B. Girshick and Ali Farhadi},
  journal={2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2016},
  pages={779-788}
}
We present YOLO, a new approach to object detection. Prior work on object detection repurposes classifiers to perform detection. Instead, we frame object detection as a regression problem to spatially separated bounding boxes and associated class probabilities. A single neural network predicts bounding boxes and class probabilities directly from full images in one evaluation. Since the whole detection pipeline is a single network, it can be optimized end-to-end directly on detection performance… Expand
Comparison Network for One-Shot Conditional Object Detection
TLDR
A novel one-shot conditional object detection framework, referred as Comparison Network (ComparisonNet), has been proposed, which can detect objects of both seen and unseen classes without further training and has the advantages including class-agnostic, training-free for unseen classes, and without catastrophic forgetting. Expand
Zero Shot Detection
TLDR
This work proposes a novel zero-shot method based on training an end-to-end model that fuses semantic attribute prediction with visual features to propose object bounding boxes for seen and unseen classes and observes significant improvements on the average precision of unseen classes. Expand
A Multi-Space Approach to Zero-Shot Object Detection
TLDR
A novel multi-space approach to solve Zero-Shot Object Detection where predictions obtained in two different search spaces are combined and the problem of hubness is discussed and it is shown that the approach alleviates hubness with a performance superior to previously proposed methods. Expand
Single-Shot Object Detection for Face Masks using YOLOv3
Object detection, a subset of computer vision, is an automated method for locating specific objects in an image. Object detec tion technique helps in the recognition, detection, as well asExpand
Dual Refinement Network for Single-Shot Object Detection
TLDR
A dual refinement network (DRN) is proposed to boost the performance of the single-stage detector and a multi-deformable head is designed, in which multiple detection paths with different receptive field sizes devote themselves to detecting objects. Expand
Towards the Success Rate of One: Real-Time Unconstrained Salient Object Detection
TLDR
This work proposes an efficient and effective approach for unconstrained salient object detection in images using deep convolutional neural networks, which performs saliency map prediction without pixel-level annotations, salient object Detection without object proposals, and salient object subitizing simultaneously, all in a single pass within a unified framework. Expand
Learning to Filter Object Detections
TLDR
A filtering network (FNet) is proposed, a method which replaces NMS with a differentiable neural network that allows joint reasoning and re-scoring of the generated set of hypotheses per image, and demonstrates that FNet, a feed-forward network architecture, is able to mimic NMS decisions, despite the sequential nature of NMS. Expand
Fast YOLO: A Fast You Only Look Once System for Real-time Embedded Object Detection in Video
TLDR
The evolutionary deep intelligence framework is leveraged to evolve the YOLOv2 network architecture and produce an optimized architecture that has 2.8X fewer parameters with just a ~2% IOU drop, and a motion-adaptive inference method is introduced into the proposed Fast Y OLO framework to reduce the frequency of deep inference with O-YOLO v2 based on temporal motion characteristics. Expand
Real-time object detection by a multi-feature fully convolutional network
TLDR
A new model free from region proposals for object detection is proposed which treats detection task as a regression problem and can predict bounding boxes and class probabilities simultaneously from a full input image. Expand
DeNet: Scalable Real-Time Object Detection with Directed Sparse Sampling
TLDR
This paper identifies a sparse distribution estimation scheme, Directed Sparse Sampling, and employs it in a single end-to-end CNN based detection model, which is scene adaptive, does not require manually defined reference bounding boxes and produces highly competitive results on MSCOCO, Pascal V OC 2007 and Pascal VOC 2012 with real-time evaluation rates. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 49 REFERENCES
Simultaneous Detection and Segmentation
TLDR
This work builds on recent work that uses convolutional neural networks to classify category-independent region proposals (R-CNN), introducing a novel architecture tailored for SDS, and uses category-specific, top-down figure-ground predictions to refine the bottom-up proposals. Expand
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
TLDR
This paper proposes a simple and scalable detection algorithm that improves mean average precision (mAP) by more than 30% relative to the previous best result on VOC 2012 -- achieving a mAP of 53.3%. Expand
Scalable Object Detection Using Deep Neural Networks
TLDR
This work proposes a saliency-inspired neural network model for detection, which predicts a set of class-agnostic bounding boxes along with a single score for each box, corresponding to its likelihood of containing any object of interest. Expand
Region-based Segmentation and Object Detection
TLDR
This work proposes a hierarchical region-based approach to joint object detection and image segmentation that simultaneously reasons about pixels, regions and objects in a coherent probabilistic model and gives a single unified description of the scene. Expand
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
TLDR
This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features. Expand
Robust Real-time Object Detection
TLDR
A visual object detection framework that is capable of processing images extremely rapidly while achieving high detection rates is described, with the introduction of a new image representation called the “Integral Image” which allows the features used by the detector to be computed very quickly. Expand
Learning to Localize Objects with Structured Output Regression
TLDR
This work proposes to treat object localization in a principled way by posing it as a problem of predicting structured data: it model the problem not as binary classification, but as the prediction of the bounding box of objects located in images. Expand
OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks
TLDR
This integrated framework for using Convolutional Networks for classification, localization and detection is the winner of the localization task of the ImageNet Large Scale Visual Recognition Challenge 2013 and obtained very competitive results for the detection and classifications tasks. Expand
Fast, Accurate Detection of 100,000 Object Classes on a Single Machine
Many object detection systems are constrained by the time required to convolve a target image with a bank of filters that code for different aspects of an object's appearance, such as the presence ofExpand
Object Detection with Discriminatively Trained Part Based Models
We describe an object detection system based on mixtures of multiscale deformable part models. Our system is able to represent highly variable object classes and achieves state-of-the-art results inExpand
...
1
2
3
4
5
...