Deep Learning Face Attributes in the Wild
- Ziwei Liu, Ping Luo, Xiaogang Wang, Xiaoou Tang
- Computer ScienceIEEE International Conference on Computer Vision
- 27 November 2014
A novel deep learning framework for attribute prediction in the wild that cascades two CNNs, LNet and ANet, which are fine-tuned jointly with attribute tags, but pre-trained differently.
Pyramid Scene Parsing Network
- Hengshuang Zhao, Jianping Shi, Xiaojuan Qi, Xiaogang Wang, Jiaya Jia
- Computer ScienceComputer Vision and Pattern Recognition
- 4 December 2016
This paper exploits the capability of global context information by different-region-based context aggregation through the pyramid pooling module together with the proposed pyramid scene parsing network (PSPNet) to produce good quality results on the scene parsing task.
DeepReID: Deep Filter Pairing Neural Network for Person Re-identification
- Wei Li, Rui Zhao, Tong Xiao, Xiaogang Wang
- Computer ScienceIEEE Conference on Computer Vision and Pattern…
- 23 June 2014
A novel filter pairing neural network (FPNN) to jointly handle misalignment, photometric and geometric transforms, occlusions and background clutter is proposed and significantly outperforms state-of-the-art methods on this dataset.
DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations
- Ziwei Liu, Ping Luo, Shi Qiu, Xiaogang Wang, Xiaoou Tang
- Computer ScienceComputer Vision and Pattern Recognition
- 27 June 2016
This work introduces DeepFashion1, a large-scale clothes dataset with comprehensive annotations, and proposes a new deep model, namely FashionNet, which learns clothing features by jointly predicting clothing attributes and landmarks.
StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks
- Han Zhang, Tao Xu, Dimitris N. Metaxas
- Computer ScienceIEEE International Conference on Computer Vision
- 10 December 2016
This paper proposes Stacked Generative Adversarial Networks (StackGAN) to generate 256 photo-realistic images conditioned on text descriptions and introduces a novel Conditioning Augmentation technique that encourages smoothness in the latent conditioning manifold.
Residual Attention Network for Image Classification
- Fei Wang, Mengqing Jiang, Xiaoou Tang
- Computer ScienceComputer Vision and Pattern Recognition
- 23 April 2017
The proposed Residual Attention Network is a convolutional neural network using attention mechanism which can incorporate with state-of-art feed forward network architecture in an end-to-end training fashion and can be easily scaled up to hundreds of layers.
Deep Learning Face Representation by Joint Identification-Verification
- Yi Sun, Yuheng Chen, Xiaogang Wang, Xiaoou Tang
- Computer ScienceNIPS
- 18 June 2014
This paper shows that the face identification-verification task can be well solved with deep learning and using both face identification and verification signals as supervision, and the error rate has been significantly reduced.
Joint Detection and Identification Feature Learning for Person Search
- Tong Xiao, Shuang Li, Bochao Wang, Liang Lin, Xiaogang Wang
- Computer ScienceComputer Vision and Pattern Recognition
- 7 April 2016
A new deep learning framework for person search that jointly handles pedestrian detection and person re-identification in a single convolutional neural network and converges much faster and better than the conventional Softmax loss.
Human Reidentification with Transferred Metric Learning
- Wei Li, Rui Zhao, Xiaogang Wang
- Computer ScienceAsian Conference on Computer Vision
- 5 November 2012
Experiments on the VIPeR dataset and the dataset show that the proposed transferred metric learning significantly outperforms directly matching visual features or using a single generic metric learned from the whole training set.
Spatial As Deep: Spatial CNN for Traffic Scene Understanding
- Xingang Pan, Jianping Shi, Ping Luo, Xiaogang Wang, Xiaoou Tang
- Computer ScienceAAAI Conference on Artificial Intelligence
- 17 December 2017
This paper proposes Spatial CNN (SCNN), which generalizes traditional deep layer- by-layer convolutions to slice-by-slice convolutions within feature maps, thus enabling message passings between pixels across rows and columns in a layer.
...
...