A Scalable Asynchronous Distributed Algorithm for Topic Modeling

  title={A Scalable Asynchronous Distributed Algorithm for Topic Modeling},
  author={Hsiang-Fu Yu and Cho-Jui Hsieh and Hyokun Yun and S. V. N. Vishwanathan and Inderjit S. Dhillon},
Learning meaningful topic models with massive document collections which contain millions of documents and billions of tokens is challenging because of two reasons. First, one needs to deal with a large number of topics (typically on the order of thousands). Second, one needs a scalable and efficient way of distributing the computation across multiple machines. In this paper, we present a novel algorithm F+Nomad LDA which simultaneously tackles both these problems. In order to handle large… CONTINUE READING
Highly Cited
This paper has 39 citations. REVIEW CITATIONS
Related Discussions
This paper has been referenced on Twitter 7 times. VIEW TWEETS


Publications citing this paper.
Showing 1-10 of 30 extracted citations


Publications referenced by this paper.
Showing 1-7 of 7 references

NOMAD: Non-locking

H. Yun, H.-F. Yu, C.-J. Hsieh, S.V.N. Vishwanathan, I. S. Dhillon
stOchastic Multi-machine algorithm for Asynchronous and Decentralized matrix completion. Proceedings of the VLDB Endowment, 7(11):975–986 • 2014
View 5 Excerpts
Highly Influenced

Distributed Algorithms for Topic Models

Journal of Machine Learning Research • 2009
View 6 Excerpts
Highly Influenced

Similar Papers

Loading similar papers…