Video Captioning With Attention-Based LSTM and Semantic Consistency

  title={Video Captioning With Attention-Based LSTM and Semantic Consistency},
  author={Lianli Gao and Zhao Guo and Hanwang Zhang and Xing Xu and Heng Tao Shen},
  journal={IEEE Transactions on Multimedia},
Recent progress in using long short-term memory (LSTM) for image captioning has motivated the exploration of their applications for video captioning. By taking a video as a sequence of features, an LSTM model is trained on video-sentence pairs and learns to associate a video to a sentence. However, most existing methods compress an entire video shot or frame into a static representation, without considering attention mechanism which allows for selecting salient features. Furthermore, existing… CONTINUE READING
Highly Cited
This paper has 114 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 56 extracted citations

114 Citations

Citations per Year
Semantic Scholar estimates that this publication has 114 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 51 references

Similar Papers

Loading similar papers…