Saravadee Sae Tan

  • Citations Per Year
Learn More
Having broad coverage of search results returned by various search sources, combining and organizing these results in a meaningful way has become a common issue in the field of information retrieval. In this demo paper, we describe our meta search system, MICE, that is able to aggregate and classify search results based on user-customized categories.(More)
This paper addresses the problem of matching between highly heterogeneous structures. The problem is modeled as a classification task where training examples are used to learn the matching between structures. In our approach, training is performed using partially labeled data. We propose a Greedy Mapping approach to generate training examples from partially(More)
On the web, most structured document collections consist of documents from different sources and marked up with different types of structures. The diversity of structures has led to the emergence of heterogeneous structured documents. The heterogeneity of structured documents is one of the reason for query-document mismatch in structured document retrieval.(More)
Structured retrieval aims at exploiting the structural information of documents when searching for documents. Structured retrieval makes use of both content and structure of documents to improve information retrieval. Therefore, the availability of semantic structure in the documents is an important factor for the success of structured retrieval. However,(More)
In recent year, XML has become the major means for information representation and exchange on the Web. Due to the increasing number of XML documents, XML similarity becomes essential in a wide range of applications like information extraction, data integration etc. In this paper we present a clustering approach that calculate similarity between XML elements(More)
EXTENDED ABSTRACT The retrieval of structured resources using unstructured queries is challenging as we need to deal with the matching between entities of two different types. Consider an unstructured query, “publications of K.H. Gan in WI”, in a structured retrieval system. To match this query to structured resources, the system needs to transform it into(More)
In the domain of financial, financial news, articles, reports about financial reviews are helpful and important information which give investors or financial analyst an indication to help decision making in financial matters. However, due to the volume of this information and the diversity of financial topics, it is difficult for a human to track and(More)
Categories are used to organize information and knowledge in directory system, folder etc. As the amount of information increase and the types of information diversify, it is common to have more categories created. As the number of categories increases, it becomes more difficult to organize, manage and look up information from existing categories. In this(More)