LDA*: A Robust and Large-scale Topic Modeling System

@article{Yu2017LDAAR,
  title={LDA*: A Robust and Large-scale Topic Modeling System},
  author={Lele Yu and Bin Cui and Ce Zhang and Yingxia Shao},
  journal={PVLDB},
  year={2017},
  volume={10},
  pages={1406-1417}
}
We present LDA∗, a system that has been deployed in one of the largest Internet companies to fulfil their requirements of “topic modeling as an internal service”—relying on thousands of machines, engineers in different sectors submit their data, some are as large as 1.8TB, to LDA∗ and get results back in hours. LDA∗ is motivated by the observation that none of the existing topic modeling systems is robust enough—Each of these existing systems is designed for a specific point in the tradeoff… CONTINUE READING

Similar Papers

Loading similar papers…