Building light-w eigh t wrappers forlegacy web data-sources using w4f
- Arnaud Sahuguet, F abien Azavan t
Organization of a web site is important to help users get the most out of the site. A good web site should help visitors nd the information they want easily .Visitors typically nd information by searc hing for selected terms of interest or b y follo wing links from one w eb page to another.The rst approach is more useful if the visitor knows exactly what he is seeking, while the second approach is useful when the visitor has less of a preconceived notion about what he w an ts.The organization of a w ebsite is especially important in the latter case. T raditionally, web site organization is done by hand. In this paper, we introduce the problem of automatic web site construction and propose a solution for solving a major step of the problem based on decision tree algorithms. The solution is found to be useful in automatic construction of product catalogs.