Corpus ID: 52181738

YouTube-VOS: A Large-Scale Video Object Segmentation Benchmark

@article{Xu2018YouTubeVOSAL,
  title={YouTube-VOS: A Large-Scale Video Object Segmentation Benchmark},
  author={N. Xu and L. Yang and Yuchen Fan and Dingcheng Yue and Yuchen Liang and Jianchao Yang and Thomas S. Huang},
  journal={ArXiv},
  year={2018},
  volume={abs/1809.03327}
}
Learning long-term spatial-temporal features are critical for many video analysis tasks. However, existing video segmentation methods predominantly rely on static image segmentation techniques, and methods capturing temporal dependency for segmentation have to depend on pretrained optical flow models, leading to suboptimal solutions for the problem. End-to-end sequential learning to explore spatialtemporal features for video segmentation is largely limited by the scale of available video… Expand
125 Citations
Video Instance Segmentation
Integrating Long-Short Term Network for Efficient Video Object Segmentation
BubbleNets: Learning to Select the Guidance Frame in Video Object Segmentation by Deep Sorting Frames
CompFeat: Comprehensive Feature Aggregation for Video Instance Segmentation
Siamese Dynamic Mask Estimation Network for Fast Video Object Segmentation
RVOS: End-To-End Recurrent Network for Video Object Segmentation
Learning Fast and Robust Target Models for Video Object Segmentation
Hybrid Sequence to Sequence Model for Video Object Segmentation
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 34 REFERENCES
YouTube-VOS: Sequence-to-Sequence Video Object Segmentation
YouTube-8M: A Large-Scale Video Classification Benchmark
A Unified Video Segmentation Benchmark: Annotation, Metrics and Analysis
Efficient Video Object Segmentation via Network Modulation
A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation
One-Shot Video Object Segmentation
Learning Video Object Segmentation with Visual Memory
Online Adaptation of Convolutional Neural Networks for Video Object Segmentation
Learning Video Object Segmentation from Static Images
MaskRNN: Instance Level Video Object Segmentation
...
1
2
3
4
...