Corpus ID: 16583343

Jubatus: An Open Source Platform for Distributed Online Machine Learning

@inproceedings{Hido2013JubatusAO,
  title={Jubatus: An Open Source Platform for Distributed Online Machine Learning},
  author={S. Hido and Seiya Tokui and Satoshi Oda},
  year={2013}
}
Distributed computing is essential for handling very large datasets. Online learning is also promising for learning from rapid data streams. However, it is still an unresolved problem how to combine them for scalable learning and prediction on big data streams. We propose a general computational framework called loose model sharing for online and distributed machine learning. The key is to share only models rather than data between distributed servers. We also introduce Jubatus, an open source… CONTINUE READING
28 Citations

Tables from this paper

A Survey on Ensemble Learning for Data Stream Classification
  • 187
PySAD: A Streaming Anomaly Detection Framework in Python
  • PDF
The next evolution of MDE: a seamless integration of machine learning into domain modeling
  • 13
  • PDF
Scalable Machine Learning through Approximation and Distributed Computing
  • PDF
Gossip-Based Behavioral Group Identification in Decentralized OSNs
  • 1
  • PDF
Data Science and Big Data Computing
  • Z. Mahmood
  • Computer Science
  • Springer International Publishing
  • 2016
  • 4
  • Highly Influenced
Recent Advancements in Event Processing
  • 52
  • PDF

References

SHOWING 1-10 OF 29 REFERENCES
Gossip learning with linear models on fully distributed data
  • 70
  • PDF
GraphLab: A New Framework For Parallel Machine Learning
  • 781
  • PDF
MapReduce/Bigtable for Distributed Optimization
  • 96
  • PDF
PSVM : Parallelizing Support Vector Machines on Distributed Computers
  • 143
  • PDF
Mahout in Action
  • 391
Online Passive-Aggressive Algorithms
  • 1,668
  • PDF
Efficient Large-Scale Distributed Training of Conditional Maximum Entropy Models
  • 273
  • PDF
Learning via Gaussian Herding
  • 55
  • PDF
Adaptive regularization of weight vectors
  • 330
  • PDF
A reliable effective terascale linear learning system
  • 337
  • PDF