Classifying XML Documents by Using Genre Features

  title={Classifying XML Documents by Using Genre Features},
  author={Malcolm Clark and Stuart N. K. Watt},
  journal={18th International Workshop on Database and Expert Systems Applications (DEXA 2007)},
The categorization of documents is traditionally topic-based. This paper presents a complementary analysis of research and experiments on genre to show that encouraging results can be obtained by using genre structure (form) features. We conducted an experiment to assess the effectiveness of using eXtensible Mark-Up Language (XML) tag information, and part… CONTINUE READING