DOCK: Detecting Objects by Transferring Common-Sense Knowledge

@inproceedings{Singh2018DOCKDO,
  title={DOCK: Detecting Objects by Transferring Common-Sense Knowledge},
  author={Krishna Kumar Singh and Santosh Kumar Divvala and Ali Farhadi and Yong Jae Lee},
  booktitle={ECCV},
  year={2018}
}
We present a scalable approach for Detecting Objects by transferring Common-sense Knowledge (DOCK) from source to target categories. In our setting, the training data for the source categories have bounding box annotations, while those for the target categories only have image-level annotations. Current state-of-the-art approaches focus on image-level visual or semantic similarity to adapt a detector trained on the source categories to the new target categories. In contrast, our key idea is to… Expand
Learning Open-World Object Proposals without Learning to Classify
Object proposals have become an integral preprocessing steps of many vision pipelines including object detection, weakly supervised detection, object discovery, tracking, etc. Compared to theExpand
Context-Aware Zero-Shot Recognition
TLDR
This work aims to understand the identity of the novel objects in an image surrounded by the known objects using the inter-object relation prior, and leverage the visual context and the geometric relationships between all pairs of objects in a single image, and capture the information useful to infer unseen categories. Expand
Exploring Bottom-Up and Top-Down Cues With Attentive Learning for Webly Supervised Object Detection
TLDR
This work proposes a novel webly supervised object detection (WebSOD) method for novel classes which only requires the web images without further annotations, and combines bottom-up and top-down cues for novel class detection. Expand
Detecting 11K Classes: Large Scale Object Detection Without Fine-Grained Bounding Boxes
  • Hao Yang, Hao-Yu Wu, Hao Chen
  • Computer Science
  • 2019 IEEE/CVF International Conference on Computer Vision (ICCV)
  • 2019
TLDR
This paper proposes a semi-supervised large scale fine-grained detection method, which only needs bounding box annotations of a smaller number of coarse- grained classes and image-level labels of large scalefine-grains classes, and can detect all classes at nearly fully-super supervised accuracy. Expand
Context for Object Detection via Lightweight Global and Mid-level Representations
  • Mesut Erhan Unal
  • Computer Science
  • 2020 25th International Conference on Pattern Recognition (ICPR)
  • 2021
TLDR
This work proposes an approach for explicitly capturing context in object detection that significantly outperforms Faster R-CNN, and performs competitively with a much more costly approach that also models context. Expand
Coarse-to-Fine Adaptive People Detection for Video Sequences by Maximizing Mutual Information †
TLDR
A coarse-to-fine framework to adapt frame by frame people detectors during runtime classification, without requiring any additional manually labeled ground truth apart from the offline training of the detection model, is proposed. Expand
Improving Visual Reasoning by Exploiting The Knowledge in Texts
TLDR
A transformerbased model that creates structured knowledge from textual input is proposed that enables the utilization of the knowledge in texts and can achieve ∼8x more accuracte results in scene graph classification, ∼3x in object classification, and ∼1.5x in predicate classification. Expand
Knowledge and Spatial Pyramid Distance-Based Gated Graph Attention Network for Remote Sensing Semantic Segmentation
TLDR
This model first uses superpixels (geographical objects) to form the nodes of a graph neural network and then uses a novel spatial pyramid distance recognition algorithm to recognize the spatial relationships and a multi-source attention mechanism and gating mechanism are designed to control the process of node aggregation. Expand
Multi-Level Knowledge Injecting for Visual Commonsense Reasoning
  • Zhang Wen, Yuxin Peng
  • Computer Science
  • IEEE Transactions on Circuits and Systems for Video Technology
  • 2021
TLDR
This work proposes Commonsense Knowledge based Reasoning Model (CKRM), a knowledge based reasoning approach, which can relate the transferred knowledge to visual content and compose the reasoning cues to derive the final answer on the challenging visual commonsense reasoning dataset VCR. Expand
Transfer Learning with Deep Recurrent Neural Networks for Remaining Useful Life Estimation
Ansi Zhang 1,2, Honglei Wang 3, Shaobo Li 1,4,* , Yuxin Cui 2, Zhonghao Liu 2 , Guanci Yang 1 and Jianjun Hu 2,4,* 1 Key Laboratory of Advanced Manufacturing Technology of Ministry of Education,Expand
...
1
2
...

References

SHOWING 1-10 OF 49 REFERENCES
Large Scale Semi-Supervised Object Detection Using Visual and Semantic Knowledge Transfer
TLDR
Strong evidence is found that visual similarity and semantic relatedness are complementary for the task, and when combined notably improve detection, achieving state-of-the-art detection performance in a semi-supervised setting. Expand
Object-Graphs for Context-Aware Visual Category Discovery
  • Yong Jae Lee, K. Grauman
  • Computer Science, Medicine
  • IEEE Transactions on Pattern Analysis and Machine Intelligence
  • 2012
TLDR
This work introduces two variants of a novel object-graph descriptor to encode the 2D and 3D spatial layout of object-level co--occurrence patterns relative to an unfamiliar region and shows that by using them to model the interaction between an image's known and unknown objects, it can better detect new visual categories. Expand
Object Detection Meets Knowledge Graphs
TLDR
A novel framework of knowledge-aware object detection is proposed, which enables the integration of external knowledge such as knowledge graphs into any object detection algorithm, which improves object detection through a re-optimization process to achieve better consistency with background knowledge. Expand
Object-Centric Spatial Pooling for Image Classification
TLDR
A framework that learns object detectors using only image-level class labels, or so-called weak labels is proposed, comparable in accuracy with state-of-the-art weakly supervised detection methods and significantly outperforms SPM-based pooling in image classification. Expand
Transfer Learning by Ranking for Weakly Supervised Object Annotation
TLDR
A novel transfer learning based on learning to rank is formulated, which effectively transfers a model for automatic annotation of object location from an auxiliary dataset to a target dataset with completely unrelated object categories. Expand
Microsoft COCO: Common Objects in Context
We present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of sceneExpand
LSDA: Large Scale Detection through Adaptation
TLDR
This paper proposes Large Scale Detection through Adaptation (LSDA), an algorithm which learns the difference between the two tasks and transfers this knowledge to classifiers for categories without bounding box annotated data, turning them into detectors. Expand
Object Classification with Adaptable Regions
TLDR
This paper proposes a new latent SVM model for category level object classification that combines spatial and co-occurrence relations between adjacent regions, such that unlikely configurations are penalized and semantic representation can be exploited for finding similar content. Expand
Detector discovery in the wild: Joint multiple instance and representation learning
TLDR
This work provides a novel formulation of a joint multiple instance learning method that includes examples from classification-style data when available, and also performs domain transfer learning to improve the underlying detector representation. Expand
The Role of Context for Object Detection and Semantic Segmentation in the Wild
TLDR
A novel deformable part-based model is proposed, which exploits both local context around each candidate detection as well as global context at the level of the scene, which significantly helps in detecting objects at all scales. Expand
...
1
2
3
4
5
...