Searching the Web: general and scientific information access

@article{Lawrence1999SearchingTW,
  title={Searching the Web: general and scientific information access},
  author={Steve Lawrence and C. Lee Giles},
  journal={First IEEE/POPOV Workshop on Internet Technologies and Services. Proceedings (Cat. No.99EX391)},
  year={1999},
  pages={18-31}
}
  • S. Lawrence, C. Lee Giles
  • Published 1999
  • Computer Science
  • First IEEE/POPOV Workshop on Internet Technologies and Services. Proceedings (Cat. No.99EX391)
The World Wide Web has revolutionized the way that people access information, and has opened up new possibilities in areas such as digital libraries, general and scientific information dissemination and retrieval, education, commerce, entertainment, government and health care. There are many avenues for improvement of the Web, for example in the areas of locating and organizing information. Current techniques for access to both general and scientific information on the Web provide much room for… Expand
Information retrieval on the web
TLDR
Overall trends cited by the sources are consistent and point to exponential growth in the past and in the coming decade, and the development of new techniques targeted to resolve some of the problems associated with Web-based information retrieval are discussed. Expand
New Methods and Tools for the World Wide Web Search
TLDR
Analysis of key aspects of recently developed Web search methods and tools are presented: visual representation of subject trees, interactive user interfaces, linguistic approaches, image search, ranking and grouping of search results, database search, and scientific information retrieval. Expand
CONTENT AND LINK STRUCTURE ANALYSIS FOR SEARCHING THE WEB
TLDR
An ideal search algorithm should find all of the relevant pages, rank them by relevance to the user query, and present a rank-ordered result to the users. Expand
A Comparison on Intelligent Web Information Retrieval Systems
TLDR
This research is to finding out the techniques which can improve the effectiveness of information retrieval, which requires new advanced tools, which covering in a better way the various phases of the information streams and capable of surviving with the severe limitations of existing tools for information retrieval on the web. Expand
Information retrieval on the web
  • Kiduk Yang
  • Computer Science
  • Annu. Rev. Inf. Sci. Technol.
  • 2005
TLDR
Researchers in Web IR have reexamined the findings from traditional IR research to discover which conventional evaluation measures may no longer be appropriate for Web IR, where a representative test collection is all but impossible to construct. Expand
Hierarchical structural approach to improving the browsability of Web search engine results
  • H. Cui, Osmar R Zaiane
  • Computer Science
  • 12th International Workshop on Database and Expert Systems Applications
  • 2001
TLDR
An agent system based on hierarchically structural approach for organizing Web search results coupled with a metasearch approach for Web searching and an ontological approach aimed at providing a mechanism to categorize search results in a semantic hierarchical organization are designed. Expand
Classification-based Retrieval Methods to Enhance Information Discovery on the Web
TLDR
Log analyses are shown to be reasonable and informative, and can be used to detect changing trends and patterns in the query stream, thus providing valuable data to a search service, as well as providing techniques and metrics for performing temporal analysis on query logs. Expand
Custom interfaces for advanced queries in search engines
TLDR
It is demonstrated that the gap between the provision of advanced search facilities and their use can be bridged, for specific information needs, by the construction of a simple interface in the form of a website that automatically formulates the necessary requests. Expand
Text Retrieval Systems for the Web
TLDR
The focus of this paper is to survey the modern approaches to the accomplishment of different Web search engine tasks, as well as modifications of the retrieval problem associated with heterogeneity of both user's needs and architectures of the search systems used. Expand
Chapter 10 World Wide Web Search Engines
Copyright © 2003, Idea Group Publishing. ABSTRACT The World Wide Web now holds more than 800 million pages covering almost all daily issues. The Web’s fast growing size and lack of structural styleExpand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 26 REFERENCES
The Anatomy of a Large-Scale Hypertextual Web Search Engine
TLDR
This paper provides an in-depth description of Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext and looks at the problem of how to effectively deal with uncontrolled hypertext collections where anyone can publish anything they want. Expand
Searching the world wide Web
TLDR
The coverage and recency of the major World Wide Web search engines was analyzed, yielding some surprising results, including a lower bound on the size of the indexable Web of 320 million pages. Expand
Multi-Engine Search and Comparison Using the MetaCrawler
TLDR
The MetaCrawler provides a single, central interface for Web document searching that facilitates customization, privacy, sophisticated ltering of references, and more and serves as a tool for comparison of diverse search services. Expand
Multi-Service Search and Comparison Using the MetaCrawler
Standard Web search services, though useful, are far from ideal. There are over a dozen di erent search services currently in existence, each with a unique interface and a database covering a diExpand
CiteSeer: an automatic citation indexing system
TLDR
CiteSeer has many advantages over traditional citation indexes, including the ability to create more up-to-date databases which are not limited to a preselected set of journals or restricted by journal publication delays, completely autonomous operation with a corresponding reduction in cost, and powerful interactive browsing of the literature using the context of citations. Expand
A Machine Learning Architecture for Optimizing Web Search Engines
TLDR
A wide range of heuristics for adjusting document rankings based on the special HTML structure of Web documents are described, including a novel one inspired by reinforcement learning techniques for propagating rewards through a graph which can be used to improve a search engine's rankings. Expand
Context and Page Analysis for Improved Web Search
TLDR
The paper discusses the features of the NECI metasearch engine and suggests ways to improve the efficiency of Web searches by downloading and analyzing each document and then displaying results that show the query terms in concert. Expand
The AltaVista Revolution: How to Find Anything on the Internet
TLDR
This book explains how to use the AltaVista service, an Internet search tool that captures the full and complete text of over 2.5 million web pages per day, without exercising any filters that might eliminate important content. Expand
A Universal Citation Database
A universal, Internet-based, bibliographic and citation database would link every scholarly work ever written - no matter how published - to every work that it cites and every work that cites it.Expand
A softbot-based interface to the Internet
TLDR
The Internet Softbot (software robot) is a fullyimplemented AI agent developed at the University of Washington that uses a UNIX shell and the World-Wide Web to interact with a wide range of internet resources. Expand
...
1
2
3
...