Unsupervised induction of stochastic context-free grammars using distributional clustering

  title={Unsupervised induction of stochastic context-free grammars using distributional clustering},
  author={Alexander Clark},
An algorithmis presentedfor learninga phrase-structuregrammarfrom tagged text. It clusterssequencesof tagstogetherbasedon local distributional information,andselectsclustersthatsatisfy a novel mutual information criterion. This criterion is shown to be related to the entropy of a randomvariableassociatedwith thetreestructures, andit is demonstratedthatit selectslinguistically plausibleconstituents.This is incorporatedin a Minimum Description Lengthalgorithm. The evaluation of unsupervisedmodels… CONTINUE READING
Highly Influential
This paper has highly influenced 13 other papers. REVIEW HIGHLY INFLUENTIAL CITATIONS
Highly Cited
This paper has 114 citations. REVIEW CITATIONS


Publications citing this paper.

114 Citations

Citations per Year
Semantic Scholar estimates that this publication has 114 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 14 references

Inducingsyntacticcategories by context distribution clustering

  • AlexanderClark.
  • Proceedings of CoNLL-2000and LLL-2000, pages91–94…
  • 2000

Towards high speedgrammarinduction on large text corpora

  • PieterAdriaans, Marten Trautwein, Marco Vervoort
  • 2000

AlgorithmsonStrings , Treesand Sequences : ComputerScienceand Computational Biology . CambridgeUniversityPress . Zellig Harris . 1954 . Distributionalstructure

  • Sydney M. Lamb
  • 1997

BayesianLearningof Probabilistic Language Models

  • AndreasStolcke.
  • Ph.D.thesis,Dept. of…
  • 1994

Elementsof Information Theory

  • ThomasM. Cover, Joy A. Thomas.
  • Wiley Seriesin Telecommunications.
  • 1991

Similar Papers

Loading similar papers…