Petuum: A New Platform for Distributed Machine Learning on Big Data

@article{Xing2015PetuumAN,
  title={Petuum: A New Platform for Distributed Machine Learning on Big Data},
  author={E. Xing and Q. Ho and Wei Dai and J. Kim and Jinliang Wei and S. Lee and X. Zheng and Pengtao Xie and Abhimanu Kumar and Y. Yu},
  journal={IEEE Trans. Big Data},
  year={2015},
  volume={1},
  pages={49-67}
}
  • E. Xing, Q. Ho, +7 authors Y. Yu
  • Published 2015
  • Computer Science
  • IEEE Trans. Big Data
  • What is a systematic way to efficiently apply a wide spectrum of advanced ML programs to industrial scale problems, using Big Models (up to 100 s of billions of parameters) on Big Data (up to terabytes or petabytes)? Modern parallelization strategies employ fine-grained operations and scheduling beyond the classic bulk-synchronous processing paradigm popularized by MapReduce, or even specialized graph-based execution that relies on graph representations of ML programs. The variety of approaches… CONTINUE READING
    187 Citations
    Petuum: A New Platform for Distributed Machine Learning on Big Data
    • 205
    • PDF
    Strategies and Principles of Distributed Machine Learning on Big Data
    • 67
    • PDF
    Angel: a new large-scale machine learning system
    • 40
    • Highly Influenced
    • PDF
    Benchmarking Harp-DAAL: High Performance Hadoop on KNL Clusters
    • Langshi Chen, Bo Peng, +10 authors J. Qiu
    • Computer Science
    • 2017 IEEE 10th International Conference on Cloud Computing (CLOUD)
    • 2017
    • 18
    • PDF
    BLAS-on-flash : An Alternative for Large Scale ML Training and Inference ?
    • Suhas Jayaram Subramanaya
    • 2018
    • PDF
    Dolphin : Runtime Optimization for Distributed Machine Learning
    • 6
    • PDF
    Parallel Processing Systems for Big Data: A Survey
    • 59
    • Highly Influenced
    • PDF

    References

    SHOWING 1-10 OF 37 REFERENCES
    Distributed GraphLab: A Framework for Machine Learning in the Cloud
    • 572
    • Highly Influential
    • PDF
    Spark: Cluster Computing with Working Sets
    • 4,369
    • Highly Influential
    • PDF
    Piccolo: Building Fast, Distributed Programs with Partitioned Tables
    • 282
    • PDF
    Scaling Distributed Machine Learning with the Parameter Server
    • 1,048
    • PDF
    Pregel: a system for large-scale graph processing
    • 3,319
    • PDF
    Hadoop: The Definitive Guide
    • 3,882
    • PDF
    LightLDA: Big Topic Models on Modest Computer Clusters
    • 148
    • PDF
    Large Scale Distributed Deep Networks
    • 2,547
    • PDF