Analyzing massive astrophysical datasets: Can Pig/Hadoop or a relational DBMS help?

  title={Analyzing massive astrophysical datasets: Can Pig/Hadoop or a relational DBMS help?},
  author={Sarah Loebman and Dylan Nunley and YongChul Kwon and Bill Howe and Magdalena Balazinska and Jeffrey P. Gardner},
  journal={2009 IEEE International Conference on Cluster Computing and Workshops},
As the datasets used to fuel modern scientific discovery grow increasingly large, they become increasingly difficult to manage using conventional software. Parallel database management systems (DBMSs) and massive-scale data processing systems such as MapReduce hold promise to address this challenge. However, since these systems have not been expressly designed for scientific applications, their efficacy in this domain has not been thoroughly tested. In this paper, we study the performance of… CONTINUE READING
Highly Cited
This paper has 64 citations. REVIEW CITATIONS
42 Citations
32 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 42 extracted citations

64 Citations

Citations per Year
Semantic Scholar estimates that this publication has 64 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 32 references

A comparison of approaches to large-scale data analysis

  • Pavlo A. et. al.
  • Proc. of the SIGMOD Conf., 2009.
  • 2009
2 Excerpts

An Architecture for Recycling Intermediates in a Columnstore Prototyping Bubba , a highly parallel database system

  • M. L. Kersten
  • Proc . of the SIGMOD Conf .
  • 2009

Similar Papers

Loading similar papers…