The “Something Something” Video Database for Learning and Evaluating Visual Common Sense

@article{Goyal2017TheS,
  title={The “Something Something” Video Database for Learning and Evaluating Visual Common Sense},
  author={Raghav Goyal and Samira Ebrahimi Kahou and Vincent Michalski and Joanna Materzynska and Susanne Westphal and Heuna Kim and Valentin Haenel and Ingo Fr{\"u}nd and Peter Yianilos and Moritz Mueller-Freitag and Florian Hoppe and Christian Thurau and Ingo Bax and Roland Memisevic},
  journal={2017 IEEE International Conference on Computer Vision (ICCV)},
  year={2017},
  pages={5843-5851}
}
Neural networks trained on datasets such as ImageNet have led to major advances in visual object classification. One obstacle that prevents networks from reasoning more deeply about complex scenes and situations, and from integrating visual knowledge with natural language, like humans do, is their lack of common sense knowledge about the physical world. Videos, unlike still images, contain a wealth of detailed information about the physical world. However, most labelled video datasets represent… CONTINUE READING
Highly Cited
This paper has 30 citations. REVIEW CITATIONS
Recent Discussions
This paper has been referenced on Twitter 45 times over the past 90 days. VIEW TWEETS
20 Citations
39 References
Similar Papers

Citations

Publications citing this paper.

References

Publications referenced by this paper.
Showing 1-10 of 39 references

Similar Papers

Loading similar papers…