Stochastic Data Acquisition for Answering Queries as Time Goes by
@article{Li2016StochasticDA, title={Stochastic Data Acquisition for Answering Queries as Time Goes by}, author={Zheng Li and Tingjian Ge}, journal={Proc. VLDB Endow.}, year={2016}, volume={10}, pages={277-288} }
Data and actions are tightly coupled. On one hand, data analysis results trigger decision making and actions. On the other hand, the action of acquiring data is the very first step in the whole data processing pipeline. Data acquisition almost always has some costs, which could be either monetary costs or computing resource costs such as sensor battery power, network transfers, or I/O costs. Using out-dated data to answer queries can avoid the data acquisition costs, but there is a penalty of…
3 Citations
In-Database Machine Learning with SQL on GPUs
- Computer ScienceSSDBM
- 2021
This work demonstrates that SQL with recursive tables makes it possible to express a complete machine learning pipeline out of data preprocessing, model training and its validation, and fine-tune GPU kernels at hardware level to allow a higher throughput and propose non-blocking synchronisation of multiple units.
Cost-efficient Data Acquisition on Online Data Marketplaces for Correlation Analysis
- Computer ScienceProc. VLDB Endow.
- 2018
It is proved that the complexity of the search problem is NP-hard, and a heuristic algorithm based on Markov chain Monte Carlo (MCMC) is designed, which demonstrates the efficiency and effectiveness of the heuristic data acquisition algorithm.
Directions in Blockchain Data Management and Analytics
- Computer Science
- 2018
Several open topics are discussed that researchers could increase focus on to leverage existing capabilities of mature data and information systems, enhance data security and privacy assurances, enable analytics services on blockchain as well as across off-chain data, and make blockchain-based systems active-oriented and intelligent.
References
SHOWING 1-10 OF 24 REFERENCES
Toward practical query pricing with QueryMarket
- Computer ScienceSIGMOD '13
- 2013
This work develops a new pricing system, QueryMarket, for flexible query pricing in a data market based on an earlier theoretical framework and shows how to use an Integer Linear Programming formulation of the pricing problem for a large class of queries, even when pricing is computationally hard.
Determining the currency of data
- Computer ScienceTODS
- 2011
A model that specifies partial currency orders in terms of simple constraints is proposed, which allows us to express what values are copied from other data sources, bearing currency Orders in those sources, in Terms of copy functions defined on correlated attributes.
Mining of Massive Datasets
- Computer Science
- 2014
Determining relevant data is key to delivering value from massive amounts of data and big data is defined less by volume which is a constantly moving target than by its ever-increasing variety, velocity, variability and complexity.
Computing the median with uncertainty
- Computer Science, MathematicsSTOC '00
- 2000
A new model for computing with uncertainty is considered and the goal is to pin down the value of f to within a precision p at a minimum possible cost.
Adaptive precision setting for cached approximate values
- Computer ScienceSIGMOD '01
- 2001
A parameterized algorithm for adjusting the precision of cached approximations adaptively to achieve the best performance as data values, precision requirements, or workload vary, which easily outperforms previous algorithms for exact caching.
Mining of Massive Datasets
- Computer Science
- 2011
This book focuses on practical algorithms that have been used to solve key problems in data mining and which can be used on even the largest datasets, and explains the tricks of locality-sensitive hashing and stream processing algorithms for mining data that arrives too fast for exhaustive processing.
Challenges and Opportunities with Big Data
- Computer ScienceProc. VLDB Endow.
- 2012
The controversies and myths surrounding Big Data are explored, to try to explore the controversies and debunk the myths around Big Data.
Planning and Acting in Partially Observable Stochastic Domains
- MathematicsArtif. Intell.
- 1998
Artificial Intelligence: A Modern Approach
- Computer Science
- 1995
The long-anticipated revision of this #1 selling book offers the most comprehensive, state of the art introduction to the theory and practice of artificial intelligence for modern applications.…