Efficient Visual Search of Videos Cast as Text Retrieval

  title={Efficient Visual Search of Videos Cast as Text Retrieval},
  author={Josef Sivic and Andrew Zisserman},
  journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
We describe an approach to object retrieval which searches for and localizes all the occurrences of an object in a video, given a query image of the object. The object is represented by a set of viewpoint invariant region descriptors so that recognition can proceed successfully despite changes in viewpoint, illumination and partial occlusion. The temporal continuity of the video within a shot is used to track the regions in order to reject those that are unstable. Efficient retrieval is… CONTINUE READING
Highly Influential
This paper has highly influenced 31 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 417 citations. REVIEW CITATIONS


Publications citing this paper.
Showing 1-10 of 231 extracted citations

418 Citations

Citations per Year
Semantic Scholar estimates that this publication has 418 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 43 references

Similar Papers

Loading similar papers…