TVGraz : Multi-Modal Learning of Object Categories by Combining Textual and Visual Features

@inproceedings{Khan2009TVGrazM,
  title={TVGraz : Multi-Modal Learning of Object Categories by Combining Textual and Visual Features},
  author={Inayatullah Khan and Amir Saffari and Horst Bischof},
  year={2009}
}
Internet offers a vast amount of multi-modal and heterogeneous information mainly in the form of textual and visual data. Most of the current web-based visual object classification methods only utilize one of these data streams. As we will show in this paper, combining these modalities in a proper way often provides better results not attainable by relying on only one of these data streams. However, up to our knowledge, there is no publicly available dataset for benchmarking algorithms which… CONTINUE READING
Highly Cited
This paper has 25 citations. REVIEW CITATIONS

Citations

Publications citing this paper.
Showing 1-10 of 17 extracted citations

References

Publications referenced by this paper.
Showing 1-10 of 23 references

, Stèphane Canu , and Yves Grandvalet . SimpleMKL

  • Alain Rakotomamonjy, Francis R. Bach
  • Journal of Machine Learning Research
  • 2008

The Caltech-256

  • G. Griffin, A. Holub, P. Perona
  • Technical report, California Institute of…
  • 2007
1 Excerpt

Forsyth . Animals on the web

  • Tamara L. Berg, A. David
  • Proceedings of the International Conference on…
  • 2006

Similar Papers

Loading similar papers…