# Learning Perceptually-Aligned Representations via Adversarial Robustness

@article{Engstrom2019LearningPR, title={Learning Perceptually-Aligned Representations via Adversarial Robustness}, author={L. Engstrom and Andrew Ilyas and Shibani Santurkar and D. Tsipras and B. Tran and A. Madry}, journal={ArXiv}, year={2019}, volume={abs/1906.00945} }

Many applications of machine learning require models that are human-aligned, i.e., that make decisions based on human-meaningful information about the input. We identify the pervasive brittleness of deep networks' learned representations as a fundamental barrier to attaining this goal. We then re-cast robust optimization as a tool for enforcing human priors on the features learned by deep neural networks. The resulting robust feature representations turn out to be significantly more aligned… CONTINUE READING

