A Key Volume Mining Deep Framework for Action Recognition

@article{Zhu2016AKV,
  title={A Key Volume Mining Deep Framework for Action Recognition},
  author={Wangjiang Zhu and Jie Hu and Gang Sun and Xudong Cao and Yu Qiao},
  journal={2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
  year={2016},
  pages={1991-1999}
}
Recently, deep learning approaches have demonstrated remarkable progresses for action recognition in videos. Most existing deep frameworks equally treat every volume i.e. spatial-temporal video clip, and directly assign a video label to all volumes sampled from it. However, within a video, discriminative actions may occur sparsely in a few key volumes, and most other volumes are irrelevant to the labeled action category. Training with a large proportion of irrelevant volumes will hurt… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-10 OF 90 CITATIONS, ESTIMATED 48% COVERAGE

90 Citations

020402016201720182019
Citations per Year
Semantic Scholar estimates that this publication has 90 citations based on the available data.

See our FAQ for additional information.

References

Publications referenced by this paper.
SHOWING 1-10 OF 36 REFERENCES

C3D: generic features for video analysis

  • D. Tran, L. D. Bourdev, R. Fergus, L. Torresani, M. Paluri
  • CoRR, abs/1412.0767
  • 2014
Highly Influential
9 Excerpts

Similar Papers

Loading similar papers…