UJM at INEX 2008 XML Mining Track

  title={UJM at INEX 2008 XML Mining Track},
  author={Mathias G{\'e}ry and Christine Largeron and Christophe Moulin},
This paper reports our experiments carried out for the INEX XML Mining track, consisting in developing categorization (or classification) and clustering methods for XML documents. We represent XML documents as vectors of indexed terms. For our first participation, the purpose of our experiments is twofold: Firstly, our overall aim is to set up a categorization text only approach that can be used as a baseline for further work which will take into account the structure of the XML documents… CONTINUE READING

From This Paper

Figures and tables from this paper.


Publications referenced by this paper.
Showing 1-8 of 8 references

The Nature of Statistical Learning Theory

Statistics for Engineering and Information Science • 2000
View 1 Excerpt

An algorithm for suffix stripping

M. F. Porter
Readings in information retrieval, • 1997
View 1 Excerpt

Denoyer and P . Gallinari . Report on the xml mining track at inex 2007 categorization and clustering of xml documents

A. Elisseeff

Similar Papers

Loading similar papers…