Hadoop’s Adolescence An analysis of Hadoop usage in scientific workloads

Abstract

We analyze Hadoop workloads from three di↵erent research clusters from a user-centric perspective. The goal is to better understand data scientists’ use of the system and how well the use of the system matches its design. Our analysis suggests that Hadoop usage is still in its adolescence. We see underuse of Hadoop features, extensions, and tools. We see… (More)

Topics

20 Figures and Tables

Statistics

01020201320142015201620172018
Citations per Year

72 Citations

Semantic Scholar estimates that this publication has 72 citations based on the available data.

See our FAQ for additional information.

Slides referencing similar topics