Database engines often consume significant power during query processing activities, motivating researchers to investigate the redesign of their internals to minimize these overheads. While the prior literature has dealt exclusively with average power considerations, our focus here is on peak power consumption. We begin by profiling the peak power behavior… (More)
This paper develops a linguistically robust Linguistic steganography approach using synonym replacement, which converts a message into semantically innocuous text. Drawing upon linguistic criteria, this approach uses word replacement, with substitution classes based on traditional word replacement features (syntactic categories and subcategories), as well… (More)
Following the 'no one size fits all' philosophy, active research in big data platforms is focusing on creating an environment for multiple 'one-size' systems to co-exist and cooperate in the same cluster. Consequently, it has now become imperative to provide an integrated management solution that provides a database-centric view of the underlying… (More)
Systems for processing big data—e.g., Hadoop, Spark, and massively parallel databases—need to run workloads on behalf of multiple tenants simultaneously. The abundant disk-based storage in these systems is usually complemented by a smaller, but much faster, cache. Cache is a precious resource: Tenants who get to use cache can see two orders of magnitude… (More)
In many algorithms in the field of data mining to perform clustering of given data, notion of 'cluster similarity' is used to a large extent. Given a set of clusters, we may want to merge them to form a smaller number of clusters; particular applications being Hierarchical clustering algorithms and ideas like SPARCL which is the motivation behind this… (More)
1. INTRODUCTION Data analytics clusters are typically set up with a centralized resource manager(RM), e.g. Yarn , Mesos . Each application scheduled on the cluster negotiates with the RM for resources. A unit of resources allocated to an application is termed as a container, the size of which is specified by the application at the time of… (More)
Acknowledgements I would like to thank my advisor Prof. Jayant Haritsa for allowing me to pursue this project under his guidance. I thank him, for his constant and valuable guidance, support and encouragement during my stay at IISc. I would also like to thank my project partner Mahesh Bale and other members of Database Systems Lab for providing a… (More)
Problem Definition Study the performance of database queries on multi-core processor by measuring last level cache misses and then analyze the effect of using HelperCore approach on the performance.