Pig latin: a not-so-foreign language for data processing
- Christopher Olston, B. Reed, U. Srivastava, Ravi Kumar, A. Tomkins
- Computer ScienceSIGMOD Conference
- 9 June 2008
A new language called Pig Latin is described, designed to fit in a sweet spot between the declarative style of SQL, and the low-level, procedural style of map-reduce, which is an open-source, Apache-incubator project, and available for general use.
Distributed top-k monitoring
- Brian Babcock, Christopher Olston
- Computer ScienceACM SIGMOD Conference
- 9 June 2003
This work shows that transmitting entire data streams is unnecessary to support top-k monitoring queries and presents an alternative approach that reduces communication significantly, and empirically through extensive simulation on real-world data that this approach reduces overall communication cost by an order of magnitude.
TensorFlow-Serving: Flexible, High-Performance ML Serving
- Christopher Olston, Noah Fiedel, Jordan Soyke
- Computer SciencearXiv.org
- 17 December 2017
TensorFlow-Serving is described, a system to serve machine learning models inside Google which is also available in the cloud and via open-source, and ways to integrate with systems that convey new models and updated versions from training to serving.
Adaptive filters for continuous queries over distributed data streams
- Christopher Olston, Jing Jiang, J. Widom
- Computer ScienceACM SIGMOD Conference
- 9 June 2003
This work considers an environment where distributed data sources continuously stream updates to a centralized processor that monitors continuous queries over the distributed data, and proposes a new technique for reducing the overhead.
Query Processing, Approximation, and Resource Management in a Data Stream Management System
- R. Motwani, J. Widom, Rohit Varma
- Computer ScienceConference on Innovative Data Systems Research
- 2003
This paper describes our ongoing work developing the Stanford Stream Data Manager (STREAM), a system for executing continuous queries over multiple continuous data streams. The STREAM system supports…
Building a HighLevel Dataflow System on top of MapReduce: The Pig Experience
- Alan Gates, Olga Natkovich, U. Srivastava
- Computer ScienceProceedings of the VLDB Endowment
- 1 August 2009
Pig is a high-level dataflow system that aims at a sweet spot between SQL and Map-Reduce, and performance comparisons between Pig execution and raw Map- Reduce execution are reported.
What's new on the web?: the evolution of the web from a search engine perspective
- A. Ntoulas, Junghoo Cho, Christopher Olston
- Computer ScienceThe Web Conference
- 17 May 2004
The authors' findings indicate a rapid turnover rate of Web pages, i.e., high rates of birth and death, coupled with an even higher rate ofturnover in the hyperlinks that connect them, which is likely to remain consistent over time.
Query Processing, Resource Management, and Approximation ina Data Stream Management System
- R. Motwani, J. Widom, Rohit Varma
- Computer Science
- 2002
This paper describes our ongoing work developing the Stanford Stream Data Manager (STREAM), a system for executing continuous queries over multiple continuous data streams. The STREAM system supports…
Adaptive precision setting for cached approximate values
- Christopher Olston, B. T. Loo, J. Widom
- Computer ScienceACM SIGMOD Conference
- 1 May 2001
A parameterized algorithm for adjusting the precision of cached approximations adaptively to achieve the best performance as data values, precision requirements, or workload vary, which easily outperforms previous algorithms for exact caching.
ScentTrails: Integrating browsing and searching on the Web
- Christopher Olston, Ed H. Chi
- Computer ScienceTCHI
- 1 September 2003
This work introduces a novel approach called ScentTrails, based on the concept of information scent developed in the context of information foraging theory, that enables users to interpolate smoothly between searching and browsing to locate content matching complex information goals effectively.
...
...