A deep semantic framework for multimodal representation learning

@article{Wang2016ADS,
  title={A deep semantic framework for multimodal representation learning},
  author={Cheng Wang and Haojin Yang and Christoph Meinel},
  journal={Multimedia Tools and Applications},
  year={2016},
  volume={75},
  pages={9255-9276}
}
Multimodal representation learning has gained increasing importance in various real-world multimedia applications. Most previous approaches focused on exploring inter-modal correlation by learning a common or intermediate space in a conventional way, e.g. Canonical Correlation Analysis (CCA). These works neglected the exploration of fusing multiple modalities at higher semantic level. In this paper, inspired by the success of deep networks in multimedia computing, we propose a novel unified… CONTINUE READING
Highly Cited
This paper has 20 citations. REVIEW CITATIONS

14 Figures & Tables

Topics

Statistics

0102020172018
Citations per Year

Citation Velocity: 12

Averaging 12 citations per year over the last 2 years.

Learn more about how we calculate this metric in our FAQ.