BlueDBM: Distributed Flash Storage for Big Data Analytics

@article{Jun2016BlueDBMDF,
  title={BlueDBM: Distributed Flash Storage for Big Data Analytics},
  author={Sang Woo Jun and Ming Liu and Sungjin Lee and Jamey Hicks and John Ankcorn and Myron King and Shuotao Xu and Arvind},
  journal={ACM Trans. Comput. Syst.},
  year={2016},
  volume={34},
  pages={7:1-7:31}
}
Complex data queries, because of their need for random accesses, have proven to be slow unless all the data can be accommodated in DRAM. There are many domains, such as genomics, geological data, and daily Twitter feeds, where the datasets of interest are 5TB to 20TB. For such a dataset, one would need a cluster with 100 servers, each with 128GB to 256GB of DRAM, to accommodate all the data in DRAM. On the other hand, such datasets could be stored easily in the flash memory of a rack-sized… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-7 OF 7 CITATIONS

References

Publications referenced by this paper.
SHOWING 1-7 OF 7 REFERENCES

High performance RDMA-based design of HDFS over InfiniBand

Nusrat Sharmin Islam, Masudar Rahman, +5 authors Dhabaleswar K. Panda
  • 2012 International Conference for High Performance Computing, Networking, Storage and Analysis
  • 2012
VIEW 7 EXCERPTS
HIGHLY INFLUENTIAL

Refactored Design of I/O Architecture for Flash Storage

  • IEEE Computer Architecture Letters
  • 2015
VIEW 4 EXCERPTS
HIGHLY INFLUENTIAL

Active Disks

Anurag Acharya, Mustafa Uysal, Joel H. Saltz
  • 1998
VIEW 4 EXCERPTS
HIGHLY INFLUENTIAL