Leveraging a scalable row store to build a distributed text index

@inproceedings{Li2009LeveragingAS,
  title={Leveraging a scalable row store to build a distributed text index},
  author={Ning Li and Jun Rao and Eugene J. Shekita and Sandeep Tata},
  booktitle={CloudDb},
  year={2009}
}
Many content-oriented applications require a scalable text index. Building such an index is challenging. In addition to the logic of inserting and searching documents, developers have to worry about issues in a typical distributed environment, such as fault tolerance, incrementally growing the index cluster, and load balancing. We developed a distributed text index called HIndex, by judiciously exploiting the control layer of HBase, which is an open source implementation of Google's Bigtable… CONTINUE READING

References

Publications referenced by this paper.
Showing 1-3 of 3 references

4. search 1 index tablet (queries/sec) 5. search 1 index tablet on gpfs (queries/sec) with filter w/o filter cold

  • Brian F. Cooper, Raghu Ramakrishnan, +6 authors Ramana
  • 2008
Highly Influential
7 Excerpts

Ghemawat: MapReduce: Simplified Data

  • Jeffrey Dean, Sanjay
  • Processing on Large Clusters,
  • 2004
Highly Influential
3 Excerpts

Similar Papers

Loading similar papers…