Corpus ID: 6065049

Detecting Structural Similarities between XML Documents

@inproceedings{Flesca2002DetectingSS,
  title={Detecting Structural Similarities between XML Documents},
  author={S. Flesca and G. Manco and E. Masciari and L. Pontieri and Andrea Pugliese},
  booktitle={WebDB},
  year={2002}
}
In this paper we propose a technique for detecting the similarity in the structure of XML documents. The technique is based on the idea of representing the structure of an XML document as a time series in which each occurrence of a tag corresponds to a given impulse. By analyzing the frequencies of the corresponding Fourier transform, we can hence state the degree of similarity between documents. The efficiency and effectiveness of this approach are compelling when compared with traditional… Expand

Figures and Topics from this paper

Structural Classification of XML Documents Using Multisets
Clustering Schemaless XML Documents
Similarity Algorithm Based on Weighted Hierarchical Structure of XML Document
Finding Syntactic Similarities Between XML Documents
A Tree-Based Approach to Clustering XML Documents by Structure
A Progressive Clustering Algorithm to Group the XML Data by Structural and Semantic Similarity
Measuring the structural similarity among XML documents and DTDs
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 83 REFERENCES
Detecting changes in XML documents
Araneus in the Era of XML
Efficient retrieval of similar time series
Query Optimization for XML
Efficient Similarity Search In Sequence Databases
Change detection in hierarchically structured information
Information integration using logical views
  • J. Ullman
  • Computer Science, Mathematics
  • Theor. Comput. Sci.
  • 2000
...
1
2
3
4
5
...