Fast Tweet Retrieval with Compact Binary Codes

  title={Fast Tweet Retrieval with Compact Binary Codes},
  author={Weiwei Guo and Wei Liu and Mona T. Diab},
The most widely used similarity measure in the field of natural language processing may be cosine similarity. However, in the context of Twitter, the large scale of massive tweet data inevitably makes it expensive to perform cosine similarity computations among tremendous data samples. In this paper, we exploit binary coding to tackle the scalability issue, which compresses each data sample into a compact binary code and hence enables highly efficient similarity computations via Hamming… CONTINUE READING


Publications referenced by this paper.
Showing 1-10 of 24 references

Similar Papers

Loading similar papers…