Kamil Bajda-Pawlikowski

Learn More
The production environment for analytical data management applications is rapidly changing. Many enterprises are shifting away from deploying their analytical databases on high-end proprietary machines, and moving towards cheaper, lower-end, commodity hardware, typically arranged in a shared-nothing MPP architecture, often in a virtualized environment(More)
Hadapt is a start-up company currently commercializing the Yale University research project called HadoopDB. The company focuses on building a platform for Big Data analytics in the cloud by introducing a storage layer optimized for structured data and by providing a framework for executing SQL queries efficiently. This work considers processing data(More)
HadoopDB is a hybrid of MapReduce and DBMS technologies, designed to meet the growing demand of analyzing massive datasets on very large clusters of machines. Our previous work has shown that HadoopDB approaches parallel databases in performance and still yields the scalability and fault tolerance of MapReduce-based systems. In this demonstration, we focus(More)
Automation of business processes, proliferation of digital devices. eBay has a 6.5 petabyte warehouse. Automation of business processes, proliferation of digital devices. eBay has a 6.5 petabyte warehouse. 2 Deep analysis over raw data: Inefficient to push data from database into specialized analysis engines → process data in the database. Best(More)
  • 1