Tagging an Unfamiliar Text With Minimal Human Supervision

  title={Tagging an Unfamiliar Text With Minimal Human Supervision},
  author={Mitch Marcus},
In this paper, we will discuss a method for tagging an unannotated text corpus whose structure is completely unknown, with a little bit of help from an informant. Starting from scratch, automated and semi-automated methods are employed to build a part of speech tagger for the text. There are three steps to building the tagger: uncovering a set of part of speech tags, discovering for each word its most likely tag, and learning rules to both correct mistakes in the dictionary and discover where… CONTINUE READING
Highly Cited
This paper has 55 citations. REVIEW CITATIONS
36 Citations
13 References
Similar Papers


Publications citing this paper.
Showing 1-10 of 36 extracted citations

56 Citations

Citations per Year
Semantic Scholar estimates that this publication has 56 citations based on the available data.

See our FAQ for additional information.


Publications referenced by this paper.
Showing 1-10 of 13 references

A Distributional Analysis Approach to Language Learning

  • E. Brill
  • Dissertation Proposal, University of Pennsylvania…
  • 1992

A Practical Partof-Speech Tagger

  • D. Cutting et al 92 Cutting, J. Kupiec, J. Pederson, P. Sibun
  • In Proceedings of the Third Conference on Applied…
  • 1992
1 Excerpt

A Theory of Language and Information

  • Harris, Zellig
  • 1991

Class-Based ngram Models of Natural Language

  • Brown
  • In Proceedings of the IBM Natural Language ITL,
  • 1990

Similar Papers

Loading similar papers…