Corpus ID: 16417084

Splash: User-friendly Programming Interface for Parallelizing Stochastic Algorithms

@article{Zhang2015SplashUP,
  title={Splash: User-friendly Programming Interface for Parallelizing Stochastic Algorithms},
  author={Yuchen Zhang and Michael I. Jordan},
  journal={ArXiv},
  year={2015},
  volume={abs/1506.07552}
}
  • Yuchen Zhang, Michael I. Jordan
  • Published in ArXiv 2015
  • Computer Science
  • Stochastic algorithms are efficient approaches to solving machine learning and optimization problems. In this paper, we propose a general framework called Splash for parallelizing stochastic algorithms on multi-node distributed systems. Splash consists of a programming interface and an execution engine. Using the programming interface, the user develops sequential stochastic algorithms without concerning any detail about distributed computing. The algorithm is then automatically parallelized by… CONTINUE READING

    Citations

    Publications citing this paper.
    SHOWING 1-10 OF 17 CITATIONS

    MLlib*: Fast Training of GLMs Using Spark MLlib

    VIEW 2 EXCERPTS
    CITES METHODS

    Parallel Machine Learning Using Concurrency Control

    VIEW 1 EXCERPT
    CITES METHODS

    Big data analytics on Apache Spark

    VIEW 1 EXCERPT
    CITES BACKGROUND

    Distributed Machine Learning with Communication Constraints

    • Yuchen Zhang
    • Computer Science
    • 2016
    VIEW 2 EXCERPTS

    High Performance Parallel Stochastic Gradient Descent in Shared Memory

    References

    Publications referenced by this paper.
    SHOWING 1-10 OF 59 REFERENCES

    Stochastic variational inference

    VIEW 12 EXCERPTS
    HIGHLY INFLUENTIAL

    Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing

    • M. Zaharia, M. Chowdhury, +6 authors I. Stoica
    • NSDI. USENIX Association,
    • 2012
    VIEW 11 EXCERPTS
    HIGHLY INFLUENTIAL

    Finding scientific topics.

    VIEW 9 EXCERPTS
    HIGHLY INFLUENTIAL

    Latent Dirichlet Allocation

    VIEW 5 EXCERPTS

    MLlib: Machine Learning in Apache Spark

    VIEW 1 EXCERPT

    Baseline methods: single-thread stochastic algorithm; MLlib (the official machine learning library for Spark). AMP Lab Splash

    • Baseline methods: single-thread stochastic algorithm; MLlib (the official machine learning library for Spark). AMP Lab Splash
    • 2015

    Machine Learning Package AMP Lab Splash

    • Machine Learning Package AMP Lab Splash
    • 2015

    Machine learning in apache spark

    • X Meng, J Bradley, +7 authors S Owen
    • Machine learning in apache spark
    • 2015