Chimera: Large-Scale Classification using Machine Learning, Rules, and Crowdsourcing

  title={Chimera: Large-Scale Classification using Machine Learning, Rules, and Crowdsourcing},
  author={Chong Sun and Narasimhan Rampalli and Frank Yang and AnHai Doan},
Large-scale classification is an increasingly critical Big Data problem. So far, however, very little has been published on how this is done in practice. In this paper we describe Chimera, our solution to classify tens of millions of products into 5000+ product types at WalmartLabs. We show that at this scale, many conventional assumptions regarding learning and crowdsourcing break down, and that existing solutions cease to work. We describe how Chimera employs a combination of learning, rules… CONTINUE READING
Highly Cited
This paper has 57 citations. REVIEW CITATIONS

2 Figures & Tables



Citations per Year

58 Citations

Semantic Scholar estimates that this publication has 58 citations based on the available data.

See our FAQ for additional information.