Towards a highly-scalable and effective metasearch engine

@inproceedings{Wu2001TowardsAH,
  title={Towards a highly-scalable and effective metasearch engine},
  author={Zonghuan Wu and Weiyi Meng and Clement T. Yu and Zhuogang Li},
  booktitle={WWW '01},
  year={2001}
}
A metasearch engine is a system that supports uni ed access to multiple local search engines. Database selection is one of the main challenges in building a large-scale metasearch engine. The problem is to eAEciently and accurately determine a small number of potentially useful local search engines to invoke for each user query. In order to enable accurate selection, metadata that re ect the contents of each search engine need to be collected and used. In this paper, we propose a highly… Expand
Building EÆcient and E ective Metasearch Engines
Frequently a user's information needs are stored in the databases of multiple search engines. It is inconvenient and ineÆcient for an ordinary user to invoke multiple search engines and identifyExpand
Building efficient and effective metasearch engines
TLDR
In this article, techniques that have been proposed to tackle several underlying challenges for building a good metasearch engine are surveyed. Expand
To Evaluate the Performance of Metasearch Engines : A Comparative Study
-The explosive growth of information source on the Web and in turn continuing technological progress of searching the information by using relevant tools like search engine poses many problems forExpand
A meta-search method reinforced by cluster descriptors
  • Y. Shen, Lee
  • Computer Science
  • Proceedings of the Second International Conference on Web Information Systems Engineering
  • 2001
TLDR
It is shown that cluster descriptors can provide a finer and more accurate representation of the document space, and hence enable the meta-search engine to improve the selection of relevant search engines. Expand
Advanced Metasearch Engine Technology
TLDR
The authors make a strong case for the viability of the large-scale metasearch engine technology as a competitive technology for Web search. Expand
QuadSearch : A Novel Metasearch Engine
Metasearch engines are increasingly becoming a very useful tool for Web information retrieval. In this paper we describe QuadSearch, an experimental metasearch engine that provides simultaneousExpand
Mining Web Graphs for Large Scale Meta Search Engine Results
TLDR
This paper is implementing the study on how to merge the search results returned from the multiple component search engines into a single ranked list through web graphs. Expand
AllInOneNews: development and evaluation of a large-scale news metasearch engine
TLDR
A novel scheme to compare multiple news search systems in a combined measure that takes both relevance and time-sensitivity of retrieved information into consideration is introduced. Expand
Towards automatic incorporation of search engines into a large-scale metasearch engine
TLDR
Automatic search engine discovery, automatic search engine connection, and automatic search engines result extraction techniques are proposed, and experiments indicate that these techniques are highly effective and efficient. Expand
Techniques for specialized search engines
TLDR
The issues in this area of specialized search engine creation are discussed and an overview of the techniques for building specialized search engines are given. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 37 REFERENCES
Building efficient and effective metasearch engines
TLDR
In this article, techniques that have been proposed to tackle several underlying challenges for building a good metasearch engine are surveyed. Expand
Efficient and effective metasearch for a large number of text databases
TLDR
This paper proposes to use a hierarchy of database representatives to improve the efficiency of metasearch engines, and provides an algorithm to search the hierarchy and shows that the retrieval effectiveness of the algorithm is the same as that of evaluating the user query against all database representatives. Expand
Experiences with selecting search engines using metasearch
TLDR
The efficacy of SavvySearch's incrementally acquired metaindex approach to selecting search engines is studied by analyzing the effect of time and experience on performance and how much experience is required to surpass the simple scheme. Expand
Server Ranking for Distributed Text Retrieval Systems on the Internet
TLDR
It is argued that delegating the task of meta-data collection to local index servers is a more scalable approach, and a mechanism for integrating distributed autonomous index servers into a cooperative resource discovery system is proposed. Expand
Detection of heterogeneities in a multiple text database environment
  • W. Meng, Clement T. Yu, King-Lup Liu
  • Computer Science
  • Proceedings Fourth IFCIS International Conference on Cooperative Information Systems. CoopIS 99 (Cat. No.PR00384)
  • 1999
TLDR
This work first analyzes the impact of various heterogeneities on building a metasearch engine, then presents some techniques that can be used to detect the most prominentheterogeneities among multiple search engines. Expand
Query routing for Web search engines: architecture and experiments
TLDR
Q-Pilot is described, an automatic query routing system that attempts to dynamically route each user query to the appropriate specialized search engines, based on an off-line component that creates an approximate model of each specialized search engine's topic. Expand
Methods for information server selection
TLDR
A novel method using Lightweight Probe queries (LWP method) is compared with several methods based on data from past query processing, while Random and Optimal server rankings serve as controls. Expand
Efficient and effective metasearch for text databases incorporating linkages among documents
TLDR
The importance (rank) of each document as determined by the linkages is integrated in each database representative to facilitate the selection of databases for each given query. Expand
Adaptive Agents for Information Gathering from Multiple, Distributed Information Sources
TLDR
An intelligent, adaptive Web search tool that can not only locate relevant information sources for the user, but also adapt to the frequent changes of the dynamic Web environment is described. Expand
Cluster-based language models for distributed retrieval
TLDR
A new approach to distributed retrieval based on document clustering and language modeling is proposed and it is shown that all three methods improve the effectiveness of distributed retrieval. Expand
...
1
2
3
4
...