Economical Inversion of Large Text Files

@article{Moffat1992EconomicalIO,
  title={Economical Inversion of Large Text Files},
  author={Alistair Moffat},
  journal={Computing Systems},
  year={1992},
  volume={5},
  pages={125-139}
}
To provide keyword-based access to a large text file it is usually necessary to invert the file and create an inverted index that storeso for each word in the file, the paragraph or sentence numbers in which that word occurs. Inverting alarge file using traditional techniques may take as much temporary disk space as is occupied by the file itself, and consume a great deal of cpu time. Here we describe an alternative technique for inverting large text files that requires only a nominal amount of… CONTINUE READING