Frustratingly Simple Few-Shot Object Detection
- Xin Wang, Thomas E. Huang, Trevor Darrell, Joseph Gonzalez, F. Yu
- Computer ScienceInternational Conference on Machine Learning
- 16 March 2020
This work finds that fine-tuning only the last layer of existing detectors on rare classes is crucial to the few-shot object detection task, and establishes a new state of the art on the revised benchmarks.
Learning Hierarchical Semantic Image Manipulation through Structured Representations
- Seunghoon Hong, Xinchen Yan, Thomas E. Huang, Honglak Lee
- Computer ScienceNeural Information Processing Systems
- 1 August 2018
A novel hierarchical framework for semantic image manipulation that allows a user to manipulate images at object-level by adding, removing, and moving one bounding box at a time is presented.
Revisiting Hierarchical Approach for Persistent Long-Term Video Prediction
- Wonkwang Lee, Whie Jung, Seunghoon Hong
- Computer ScienceInternational Conference on Learning…
- 14 April 2021
This work revisits hierarchical models in video prediction, and shows that modeling structures and their dynamics in the discrete semantic structure space with a stochastic recurrent estimator leads to surprisingly successful long-term prediction.
Robust Object Detection via Instance-Level Temporal Cycle Confusion
- Xin Wang, Thomas E. Huang, Trevor Darrell
- Computer ScienceIEEE International Conference on Computer Vision
- 16 April 2021
This work introduces a novel self-supervised task, instance-level temporal cycle confusion (CycConf), which operates on the region features of the object detectors, and encourages the object detector to explore invariant structures across instances under various motions, which leads to improved model robustness in unseen domains at test time.
Tracking Every Thing in the Wild
- Siyuan Li, Martin Danelljan, Henghui Ding, Thomas E. Huang, F. Yu
- Computer ScienceEuropean Conference on Computer Vision
- 26 July 2022
A new metric, Track Every Thing Accuracy (TETA), is introduced, breaking tracking measurement into three sub-factors: localization, association, and classification, allowing comprehensive benchmarking of tracking performance even under inaccurate classification.
Dense Prediction with Attentive Feature Aggregation
- Yung-Hsu Yang, Thomas E. Huang, S. R. Bulò, P. Kontschieder, F. Yu
- Computer ScienceIEEE Workshop/Winter Conference on Applications…
- 1 November 2021
Attentive Feature Aggregation is introduced to fuse different network layers with more expressive non-linear operations and improves the performance of the Deep Layer Aggregation model by nearly 6% mIoU on Cityscapes.
QDTrack: Quasi-Dense Similarity Learning for Appearance-Only Multiple Object Tracking
- Tobias Fischer, Jiangmiao Pang, F. Yu
- Computer ScienceArXiv
- 12 October 2022
Quasi-Dense Similarity Learning is presented, which densely samples hundreds of object regions on a pair of images for contrastive learning and which rivals the performance of state-of-the-art tracking methods on all benchmarks and sets a new state of theart on the large-scale BDD100K MOT benchmark, while introducing negligible computational overhead to the detector.
Composite Learning for Robust and Effective Dense Predictions
- M. Kanakis, Thomas E. Huang, David Bruggemann, F. Yu, L. Gool
- Computer ScienceIEEE Workshop/Winter Conference on Applications…
- 13 October 2022
It is found that jointly training a dense prediction task with a self-supervised (auxiliary) task can consistently improve the performance of the target task, while eliminating the need for labeling auxiliary tasks.