Hierarchical Mixtures of Experts and the EM Algorithm

@article{Jordan1994HierarchicalMO,
  title={Hierarchical Mixtures of Experts and the EM Algorithm},
  author={Michael I. Jordan and Robert A. Jacobs},
  journal={Neural Computation},
  year={1994},
  volume={6},
  pages={181-214}
}
We present a tree-structured architecture for supervised learning. The statistical model underlying the architecture is a hierarchical mixture model in which both the mixture coefficients and the mixture components are generalized linear models (GLIM's). Learning is treated as a maximum likelihood problem; in particular, we present an Expectation-Maximization (EM) algorithm for adjusting the parameters of the architecture. We also develop an on-line learning algorithm in which the parameters… CONTINUE READING
Highly Influential
This paper has highly influenced 230 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 4,010 citations. REVIEW CITATIONS
Recent Discussions
This paper has been referenced on Twitter 1 time over the past 90 days. VIEW TWEETS

Citations

Publications citing this paper.
Showing 1-10 of 1,652 extracted citations

4,011 Citations

0100200'93'98'04'10'16
Citations per Year
Semantic Scholar estimates that this publication has 4,011 citations based on the available data.

See our FAQ for additional information.

References

Publications referenced by this paper.
Showing 1-10 of 26 references

Convergence Properties of the EM Approach to Learning in Mixture-of-Experts Architectures

  • M. I. Jordan, L. Xu
  • Computational Cognitive Science Tech. Rep. 9301…
  • 1993
Highly Influential
5 Excerpts

Soft Competitive Adaptation: Neural Network Learning Algorithms

  • S. J. Nowlan
  • 1991
Highly Influential
6 Excerpts

Probabilistic interpretation of feedforward classification network outputs, with relationships to statistical pattern recognition

  • J.
  • Neurocomputing: Algorithms, Architectures, and…
  • 1989
Highly Influential
8 Excerpts

OCI: A Randomized Algorithm

  • S. K. Murthy, S. Kasif, S. Salzberg
  • 1993
1 Excerpt

Adaptive Filter Theory. Prentice-Hall, Englrwood Cliffs, NJ

  • Hall, S. London. Haykin
  • 1991

Similar Papers

Loading similar papers…