C V ] 9 N ov 2 01 8 Cross and Learn : Cross-Modal Self-Supervision

  title={C V ] 9 N ov 2 01 8 Cross and Learn : Cross-Modal Self-Supervision},
  author={Nawid Sayed and Biagio Brattoli and Bj{\"o}rn Ommer},
In this paper we present a self-supervised method for representation learning utilizing two different modalities. Based on the observation that cross-modal information has a high semantic meaning we propose a method to effectively exploit this signal. For our approach we utilize video data since it is available on a large scale and provides easily accessible modalities given by RGB and optical flow. We demonstrate state-of-the-art performance on highly contested action recognition datasets in… CONTINUE READING

From This Paper

Figures and tables from this paper.


Publications referenced by this paper.
Showing 1-10 of 41 references

Unsupervised Representation Learning by Sorting Sequences

2017 IEEE International Conference on Computer Vision (ICCV) • 2017
View 11 Excerpts
Highly Influenced

Caffe: Convolutional Architecture for Fast Feature Embedding

ACM Multimedia • 2014
View 5 Excerpts
Highly Influenced

HMDB: A large video database for human motion recognition

2011 International Conference on Computer Vision • 2011
View 14 Excerpts
Highly Influenced

Similar Papers

Loading similar papers…