SUGAR: Learning Skeleton Representation with Visual-Motion Knowledge for Action Recognition
This paper introduces a novel paradigm named learning Skeleton representation with visUal-motion knowledGe for Action Recognition (SUGAR), and proposes to supervise skeleton learning through this prior knowledge to yield discrete representations.
YOLOX-B: A Better Yolox Model for Real-Time Driver Behavior Detection
- Xu GuoMing MaJiaqiang ZhangShaojie Li
- 4 June 2023
Computer Science, Engineering
A new model YOLOX-B is proposed, which introduces a serialized atrous spatial pyramid pooling structure (S-ASPP), obtains different sizes of receptive field information through serialized Atrous Convolution, solves the problem of information loss in max-pooling, and maximizes the efficiency of atrous convolution.
IFF-Net: I-Frame Fusion Network for Compressed Video Action Recognition
- Shaojie LiJinxin GuoJiaqiang ZhangXu GuoMing Ma
- 1 October 2023
Computer Science, Engineering
This work proposes a Time Domain Fusion (TDF) Module that can extract both low-frequency and high-frequency components from the video and integrate them seamlessly, resulting in the effective integration of abundant motion information into a single frame.