Xu Guo | Semantic Scholar

SUGAR: Learning Skeleton Representation with Visual-Motion Knowledge for Action Recognition

Computer Science

13 November 2025

This paper introduces a novel paradigm named learning Skeleton representation with visUal-motion knowledGe for Action Recognition (SUGAR), and proposes to supervise skeleton learning through this prior knowledge to yield discrete representations.

arXiv

YOLOX-B: A Better Yolox Model for Real-Time Driver Behavior Detection

Xu GuoMing MaJiaqiang ZhangShaojie Li

Computer Science, Engineering

IEEE International Conference on Acoustics…

4 June 2023

A new model YOLOX-B is proposed, which introduces a serialized atrous spatial pyramid pooling structure (S-ASPP), obtains different sizes of receptive field information through serialized Atrous Convolution, solves the problem of information loss in max-pooling, and maximizes the efficiency of atrous convolution.

IEEE

IFF-Net: I-Frame Fusion Network for Compressed Video Action Recognition

Shaojie LiJinxin GuoJiaqiang ZhangXu GuoMing Ma

Computer Science, Engineering

IEEE International Conference on Systems, Man and…

1 October 2023

This work proposes a Time Domain Fusion (TDF) Module that can extract both low-frequency and high-frequency components from the video and integrate them seamlessly, resulting in the effective integration of abundant motion information into a single frame.

IEEE