Comparing inverted files and signature files for searching a large lexicon

@article{Carterette2005ComparingIF,
  title={Comparing inverted files and signature files for searching a large lexicon},
  author={Ben Carterette and Fazli Can},
  journal={Inf. Process. Manage.},
  year={2005},
  volume={41},
  pages={613-633}
}
Signature files and inverted files are well-known index structures. In this paper we undertake a direct comparison of the two for searching for partially-specified queries in a large lexicon stored in main memory. Using n-grams to index lexicon terms, a bit-sliced signature file can be compressed to a smaller size than an inverted file if each n-gram sets only one bit in the term signature. With a signature width less than half the number of unique n-grams in the lexicon, the signature file… CONTINUE READING