Analysis of large data logs: an application of Poisson sampling on excite web queries

  title={Analysis of large data logs: an application of Poisson sampling on excite web queries},
  author={Huseyin Cenk {\"O}zmutlu and Amanda Spink and Seda {\"O}zmutlu},
  journal={Inf. Process. Manage.},
Search engines are the gateway for users to retrieve information from the Web. There is a crucial need for tools that allow effective analysis of search engine queries to provide a greater understanding of Web users' information seeking behavior. The objective of the study is to develop an effective strategy for the selection of samples from large-scale data sets. Millions of queries are submitted to Web search engines daily and new sampling techniques are required to bring these databases to a… CONTINUE READING


Publications citing this paper.
Showing 1-10 of 21 extracted citations

Automatic New Topic Identification in Search Engine Transaction Logs  Using Multiple Linear Regression

Proceedings of the 41st Annual Hawaii International Conference on System Sciences (HICSS 2008) • 2008
View 1 Excerpt


Publications referenced by this paper.
Showing 1-8 of 8 references

Time-based web mining of search logs: implications for efficient operations

Ozmutlu et al, H. C. 2001. Ozmutlu, A. Spink, A. Hurson
In Proceedings of IC2001: International Conference on Internet Computing • 2001

Methods for statistical analysis of reliability and life

D. C. Montgomery
web. Nature • 1999

Use of query reformulation and relevance feedback by web users

B. J. Jansen A. Spink, H. C. Ozmultu
Internet Research : Electronic Networking Applications and Policy • 1999

An analysis of Internet search engines : assessment of over 200 search queries

J. G. Packer
Computers in Libraries

Similar Papers

Loading similar papers…