Snap-and-ask: answering multimodal question by naming visual instance

@inproceedings{Zhang2012SnapandaskAM,
  title={Snap-and-ask: answering multimodal question by naming visual instance},
  author={Wei Zhang and Lei Pang and Chong-Wah Ngo},
  booktitle={ACM Multimedia},
  year={2012}
}
In real-life, it is easier to provide a visual cue when asking a question about a possibly unfamiliar topic, for example, asking the question, "Where was this crop circle found?". Providing an image of the instance is far more convenient than texting a verbose description of the visual properties, especially when the name of the query instance is not known. Nevertheless, having to identify the visual instance before processing the question and eventually returning the answer makes multimodal… CONTINUE READING

Similar Papers

Citations

Publications citing this paper.
SHOWING 1-10 OF 15 CITATIONS

Visual instance mining from the graph perspective

VIEW 6 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Graph-based visual instance mining with geometric matching and nearest candidates selection

  • 2017 9th International Conference on Knowledge and Systems Engineering (KSE)
  • 2017
VIEW 3 EXCERPTS
CITES METHODS & BACKGROUND

Second-Order Configuration of Local Features for Geometrically Stable Image Matching and Retrieval

  • IEEE Transactions on Circuits and Systems for Video Technology
  • 2015
VIEW 2 EXCERPTS
CITES METHODS

Topological Spatial Verification for Instance Search

  • IEEE Transactions on Multimedia
  • 2015
VIEW 2 EXCERPTS
CITES METHODS & BACKGROUND

References

Publications referenced by this paper.
SHOWING 1-5 OF 5 REFERENCES

Improving Bag-of-Features for Large Scale Image Search

  • International Journal of Computer Vision
  • 2009
VIEW 14 EXCERPTS
HIGHLY INFLUENTIAL

On the Annotation of Web Videos by Efficient Near-Duplicate Search

  • IEEE Transactions on Multimedia
  • 2010
VIEW 11 EXCERPTS
HIGHLY INFLUENTIAL

Evaluation campaigns and TRECVid

  • Multimedia Information Retrieval
  • 2006
VIEW 6 EXCERPTS
HIGHLY INFLUENTIAL

Object retrieval with large vocabularies and fast spatial matching

  • 2007 IEEE Conference on Computer Vision and Pattern Recognition
  • 2007
VIEW 3 EXCERPTS
HIGHLY INFLUENTIAL