Context Encoders: Feature Learning by Inpainting
- Deepak Pathak, Philipp Krähenbühl, Jeff Donahue, Trevor Darrell, Alexei A. Efros
- Computer ScienceComputer Vision and Pattern Recognition
- 25 April 2016
It is found that a context encoder learns a representation that captures not just appearance but also the semantics of visual structures, and can be used for semantic inpainting tasks, either stand-alone or as initialization for non-parametric methods.
Curiosity-Driven Exploration by Self-Supervised Prediction
- Deepak Pathak, Pulkit Agrawal, Alexei A. Efros, Trevor Darrell
- Computer ScienceIEEE Conference on Computer Vision and Pattern…
- 15 May 2017
This work forms curiosity as the error in an agent's ability to predict the consequence of its own actions in a visual feature space learned by a self-supervised inverse dynamics model, which scales to high-dimensional continuous state spaces like images, bypasses the difficulties of directly predicting pixels, and ignores the aspects of the environment that cannot affect the agent.
Toward Multimodal Image-to-Image Translation
- Jun-Yan Zhu, Richard Zhang, Eli Shechtman
- Computer ScienceNIPS
- 1 November 2017
This work aims to model a distribution of possible outputs in a conditional generative modeling setting that helps prevent a many-to-one mapping from the latent code to the output during training, also known as the problem of mode collapse.
Large-Scale Study of Curiosity-Driven Learning
- Yuri Burda, Harrison Edwards, Deepak Pathak, A. Storkey, Trevor Darrell, Alexei A. Efros
- Computer ScienceInternational Conference on Learning…
- 13 August 2018
This paper performs the first large-scale study of purely curiosity-driven learning, i.e. without any extrinsic rewards, across 54 standard benchmark environments, including the Atari game suite, and shows surprisingly good performance.
Constrained Convolutional Neural Networks for Weakly Supervised Segmentation
- Deepak Pathak, Philipp Krähenbühl, Trevor Darrell
- Computer ScienceIEEE International Conference on Computer Vision
- 11 June 2015
This work proposes Constrained CNN (CCNN), a method which uses a novel loss function to optimize for any set of linear constraints on the output space of a CNN, and demonstrates the generality of this new learning framework.
Self-Supervised Exploration via Disagreement
- Deepak Pathak, Dhiraj Gandhi, A. Gupta
- Computer ScienceInternational Conference on Machine Learning
- 24 May 2019
This paper proposes a formulation for exploration inspired by the work in active learning literature and trains an ensemble of dynamics models and incentivizes the agent to explore such that the disagreement of those ensembles is maximized, which results in a sample-efficient exploration.
Learning Features by Watching Objects Move
- Deepak Pathak, Ross B. Girshick, Piotr Dollár, Trevor Darrell, Bharath Hariharan
- Computer ScienceComputer Vision and Pattern Recognition
- 19 December 2016
Inspired by the human visual system, low-level motion-based grouping cues can be used to learn an effective visual representation that significantly outperforms previous unsupervised approaches across multiple settings, especially when training data for the target task is scarce.
Fully Convolutional Multi-Class Multiple Instance Learning
- Deepak Pathak, Evan Shelhamer, Jonathan Long, Trevor Darrell
- Computer ScienceInternational Conference on Learning…
- 22 December 2014
This work proposes a novel MIL formulation of multi-class semantic segmentation learning by a fully convolutional network that exploits the further supervision given by images with multiple labels.
Zero-Shot Visual Imitation
- Deepak Pathak, Parsa Mahmoudieh, Trevor Darrell
- Computer ScienceIEEE/CVF Conference on Computer Vision and…
- 15 February 2018
This workmitating expert demonstration is a powerful mechanism for learning to perform tasks from raw sensory observations by providing multiple demonstrations of a task at training time, and this generates data in the form of observation-action pairs from the agent's point of view.
Planning to Explore via Self-Supervised World Models
- Ramanan Sekar, Oleh Rybkin, Kostas Daniilidis, P. Abbeel, Danijar Hafner, Deepak Pathak
- Computer ScienceInternational Conference on Machine Learning
- 12 May 2020
Without any training supervision or task-specific interaction, Plan2Explore outperforms prior self-supervised exploration methods, and in fact, almost matches the performances oracle which has access to rewards.
...
...