Large-scale Incremental Processing Using Distributed Transactions and Notifications

@inproceedings{Peng2010LargescaleIP,
  title={Large-scale Incremental Processing Using Distributed Transactions and Notifications},
  author={Daniel Peng and Frank Dabek},
  booktitle={OSDI},
  year={2010}
}
Updating an index of the web as documents are crawled requires continuously transforming a large repository of existing documents as new documents arrive. This task is one example of a class of data processing tasks that transform a large repository of data via small, independent mutations. These tasks lie in a gap between the capabilities of existing infrastructure. Databases do not meet the storage or throughput requirements of these tasks: Google’s indexing system stores tens of petabytes of… CONTINUE READING
Highly Influential
This paper has highly influenced 46 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 484 citations. REVIEW CITATIONS

9 Figures & Tables

Topics

Statistics

050100201020112012201320142015201620172018
Citations per Year

484 Citations

Semantic Scholar estimates that this publication has 484 citations based on the available data.

See our FAQ for additional information.