Deep Learning Face Attributes in the Wild
- Ziwei Liu, Ping Luo, Xiaogang Wang, Xiaoou Tang
- Computer ScienceIEEE International Conference on Computer Vision
- 27 November 2014
A novel deep learning framework for attribute prediction in the wild that cascades two CNNs, LNet and ANet, which are fine-tuned jointly with attribute tags, but pre-trained differently.
DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations
- Ziwei Liu, Ping Luo, Shi Qiu, Xiaogang Wang, Xiaoou Tang
- Computer ScienceComputer Vision and Pattern Recognition
- 27 June 2016
This work introduces DeepFashion1, a large-scale clothes dataset with comprehensive annotations, and proposes a new deep model, namely FashionNet, which learns clothing features by jointly predicting clothing attributes and landmarks.
WIDER FACE: A Face Detection Benchmark
- Shuo Yang, Ping Luo, Chen Change Loy, Xiaoou Tang
- Computer ScienceComputer Vision and Pattern Recognition
- 20 November 2015
There is a gap between current face detection performance and the real world requirements, and the WIDER FACE dataset, which is 10 times larger than existing datasets is introduced, which contains rich annotations, including occlusions, poses, event categories, and face bounding boxes.
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers
- Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, J. Álvarez, Ping Luo
- Computer ScienceNeural Information Processing Systems
- 31 May 2021
SegFormer is presented, a simple, efficient yet powerful semantic segmentation framework which unifies Transformers with lightweight multilayer perceptron (MLP) decoders and shows excellent zero-shot robustness on Cityscapes-C.
A large-scale car dataset for fine-grained categorization and verification
- L. Yang, Ping Luo, Chen Change Loy, Xiaoou Tang
- Computer ScienceComputer Vision and Pattern Recognition
- 7 June 2015
This paper presents an on-going effort in collecting a large-scale dataset, “CompCars”, that covers not only different car views, but also their different internal and external parts, and rich attributes, and demonstrates a few important applications exploiting the dataset.
Spatial As Deep: Spatial CNN for Traffic Scene Understanding
- Xingang Pan, Jianping Shi, Ping Luo, Xiaogang Wang, Xiaoou Tang
- Computer ScienceAAAI Conference on Artificial Intelligence
- 17 December 2017
This paper proposes Spatial CNN (SCNN), which generalizes traditional deep layer- by-layer convolutions to slice-by-slice convolutions within feature maps, thus enabling message passings between pixels across rows and columns in a layer.
Facial Landmark Detection by Deep Multi-task Learning
- Zhanpeng Zhang, Ping Luo, Chen Change Loy, Xiaoou Tang
- Computer ScienceEuropean Conference on Computer Vision
- 6 September 2014
A novel tasks-constrained deep model is formulated, with task-wise early stopping to facilitate learning convergence and reduces model complexity drastically compared to the state-of-the-art method based on cascaded deep model.
MaskGAN: Towards Diverse and Interactive Facial Image Manipulation
- Cheng-Han Lee, Ziwei Liu, Lingyun Wu, Ping Luo
- Computer ScienceComputer Vision and Pattern Recognition
- 27 July 2019
This work proposes a novel framework termed MaskGAN, enabling diverse and interactive face manipulation, and finds that semantic masks serve as a suitable intermediate representation for flexible face manipulation with fidelity preservation.
Pedestrian Attribute Recognition At Far Distance
- Yubin Deng, Ping Luo, Chen Change Loy, Xiaoou Tang
- Computer ScienceACM Multimedia
- 3 November 2014
A new pedestrian attribute dataset is released, which is by far the largest and most diverse of its kind and it is shown that the large-scale dataset facilitates the learning of robust attribute detectors with good generalization performance.
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals
- Pei Sun, Rufeng Zhang, Ping Luo
- Computer ScienceComputer Vision and Pattern Recognition
- 25 November 2020
Sparse R-CNN demonstrates accuracy, run-time and training convergence performance on par with the well-established detector baselines on the challenging COCO dataset, e.g., achieving 45.0 AP in standard 3× training schedule and running at 22 fps using ResNet-50 FPN model.
...
...