Label-Based Automatic Alignment of Video with Narrative Sentences

@inproceedings{Dogan2016LabelBasedAA,
  title={Label-Based Automatic Alignment of Video with Narrative Sentences},
  author={Pelin Dogan and Markus H. Gross and Jean Charles Bazin},
  booktitle={ECCV Workshops},
  year={2016}
}
In this paper we consider videos (e.g. Hollywood movies) and their accompanying natural language descriptions in the form of narrative sentences (e.g. movie scripts without timestamps). We propose a method for temporally aligning the video frames with the sentences using both visual and textual information, which provides automatic timestamps for each narrative sentence. We compute the similarity between both types of information using vectorial descriptors and propose to cast this alignment… CONTINUE READING