• Publications
  • Influence
High Performance Visual Tracking with Siamese Region Proposal Network
The Siamese region proposal network (Siamese-RPN) is proposed which is end-to-end trained off-line with large-scale image pairs for visual object tracking and consists of SiAMESe subnetwork for feature extraction and region proposal subnetwork including the classification branch and regression branch. Expand
SiamRPN++: Evolution of Siamese Visual Tracking With Very Deep Networks
This work proves the core reason Siamese trackers still have accuracy gap comes from the lack of strict translation invariance, and proposes a new model architecture to perform depth-wise and layer-wise aggregations, which not only improves the accuracy but also reduces the model size. Expand
A face antispoofing database with diverse attacks
A face antispoofing database which covers a diverse range of potential attack variations, and a baseline algorithm is given for comparison, which explores the high frequency information in the facial region to determine the liveness. Expand
Distractor-aware Siamese Networks for Visual Object Tracking
This paper focuses on learning distractor-aware Siamese networks for accurate and long-term tracking, and extends the proposed approach for long- term tracking by introducing a simple yet effective local-to-global search region strategy. Expand
Spindle Net: Person Re-identification with Human Body Region Guided Feature Decomposition and Fusion
This study proposes a novel Convolutional Neural Network, called Spindle Net, based on human body region guided multi-stage feature decomposition and tree-structured competitive feature fusion, which is the first time human body structure information is considered in a CNN framework to facilitate feature learning. Expand
HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis
A new attentionbased deep neural network, named as HydraPlus-Net (HPnet), that multi-directionally feeds the multi-level attention maps to different feature layers to enrich the final feature representations for a pedestrian image. Expand
Deep Cocktail Network: Multi-source Unsupervised Domain Adaptation with Category Shift
This paper proposes a deep cocktail network (DCTN) to battle the domain and category shifts among multiple sources and evaluates DCTN in three domain adaptation benchmarks, which clearly demonstrate the superiority of the framework. Expand
High-fidelity Pose and Expression Normalization for face recognition in the wild
A High-fidelity Pose and Expression Normalization (HPEN) method with 3D Morphable Model (3DMM) which can automatically generate a natural face image in frontal pose and neutral expression and an inpainting method based on Possion Editing to fill the invisible region caused by self occlusion is proposed. Expand
FOTS: Fast Oriented Text Spotting with a Unified Network
This work proposes a unified end-to-end trainable Fast Oriented Text Spotting (FOTS) network for simultaneous detection and recognition, sharing computation and visual information among the two complementary tasks, and introduces RoIRotate to share convolutional features between detection and Recognition. Expand
Quality Aware Network for Set to Set Recognition
  • Yu Liu, Junjie Yan, Wanli Ouyang
  • Computer Science
  • IEEE Conference on Computer Vision and Pattern…
  • 11 April 2017
Analysis on gradient spread of this mechanism indicates that the quality learned by the network is beneficial to set-to-set recognition and simplifies the distribution that the network needs to fit. Expand