Leveraging a scalable row store to build a distributed text index

  title={Leveraging a scalable row store to build a distributed text index},
  author={Ning Li and Jun Rao and Eugene J. Shekita and Sandeep Tata},
Many content-oriented applications require a scalable text index. Building such an index is challenging. In addition to the logic of inserting and searching documents, developers have to worry about issues in a typical distributed environment, such as fault tolerance, incrementally growing the index cluster, and load balancing. We developed a distributed text index called HIndex, by judiciously exploiting the control layer of HBase, which is an open source implementation of Google's Bigtable… CONTINUE READING


Publications referenced by this paper.
Showing 1-3 of 3 references

4. search 1 index tablet (queries/sec) 5. search 1 index tablet on gpfs (queries/sec) with filter w/o filter cold

  • Brian F. Cooper, Raghu Ramakrishnan, +6 authors Ramana
  • 2008
Highly Influential
7 Excerpts

Ghemawat: MapReduce: Simplified Data

  • Jeffrey Dean, Sanjay
  • Processing on Large Clusters,
  • 2004
Highly Influential
3 Excerpts

Similar Papers

Loading similar papers…