Characterizing Web user sessions

@article{Arlitt2000CharacterizingWU,
  title={Characterizing Web user sessions},
  author={Martin F. Arlitt},
  journal={SIGMETRICS Perform. Evaluation Rev.},
  year={2000},
  volume={28},
  pages={50-63}
}
  • M. Arlitt
  • Published 1 September 2000
  • Computer Science
  • SIGMETRICS Perform. Evaluation Rev.
This paper presents a detailed characterization of user sessions to the 1998 World Cup Web site. This study analyzes data that was collected from the World Cup site over a three month period. During this time the site received 1.35 billion requests from 2.8 million distinct clients. This study focuses on numerous user session characteristics, including distributions for the number of requests per session, number of pages requested per session, session length and inter-session times. This paper… 

Figures and Tables from this paper

Hierarchical Workload Characterization for a Busy Web Server

The behavioural characteristics that emerge from this study show different features at each level of the Web server access hierarchy and suggest effective stategies for managing resources at busy Internet Web servers.

Characterizing user sessions on YouTube

It is found that YouTube users transfer more data and have longer think times than traditional Web workloads, which have implications for network capacity planning and design of next generation synthetic Web workloading.

Discovering Web Workload Characteristics through Cluster Analysis

Cl clustering analysis of session-based Web workloads of eight Web servers using the intrasession characteristics (i.e., number of requests per session, session length in time, and bytes transferred per session) as variables is presented.

Characterizing broadband user behavior

This paper presents a characterization of broadband user behavior from a Internet service provider and uncovers two main groups of session request patterns within each user category.

Workload characterization and customer interaction at e-commerce web servers

This thesis analyzes the Web access logs at public Web sites for three organizations: a car rental company, an IT company, and the Computer Science department of the University of Saskatchewan to identify session clusters and proposes a hybrid clustering algorithm.

Identifying User Behavior by Analyzing Web Server Access Log File

In-depth analysis of Web Log Data of NASA website to find information about a web site, top errors, potential visitors of the site etc, which help system administrator and Web designer to improve their system by determining occurred systems errors, corrupted and broken links by using web using mining.

Analyzing user profiles in electronic markets

A workload characterization and evaluates the navigation profiles of humans and bots in an electronic market, using a graph model to represent the user navigation, identifying typical differences among their behaviors.

Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites

It is shown that a Web server augmented with the admission control mechanism is able to provide a fair guarantee of completion, for any accepted session, independent of a session length, which is a critical requirement for any e-business.

Performance vs. freshness in web database applications

This work introduces two new semantic-based data freshness metrics that capture the content dependencies and proposed two materialization algorithms that balance QoS and QoD, and shows that this approach outperforms existing QoS-QoD balancing approaches in terms of server-side response time (throughput), datafreshness and scalability.

A characterization of broadband user behavior and their e-business activities

This paper presents a characterization of broadband user behavior from an Internet Service Provider standpoint, and finds that subscription-based and advertising services account for the vast majority of user HTTP requests in both residential and SOHO workloads.
...

References

SHOWING 1-10 OF 12 REFERENCES

A workload characterization study of the 1998 World Cup Web site

It is found that improvements in the caching architecture of the World Wide Web are changing the workloads of Web servers, but major improvements to that architecture are still necessary.

Workload Characterization of the 1998 World Cup Web Site

Analysis of the 1998 World Cup Web site finds that improvements in the caching architecture of the World-Wide Web are changing the workloads of Web servers, but that major improvements to that architecture are still necessary.

Internet Web servers: workload characterization and performance implications

The paper concludes with a discussion of caching and performance issues, using the observed workload characteristics to suggest performance enhancements that seem promising for Internet Web servers.

A methodology for workload characterization of E-commerce sites

This paper introduces a state transition graph called Customer Behavior Model Graph (CBMG), that is used to describe the behavior of groups of customers who exhibit similar navigational patterns, and proposes a clustering algorithm to characterize workloads of e-commerce sites in terms of CBMGs.

Addressing the challenges of web data transport

To avoid performance degradation, end-host and router-based techniques are developed that both reduce disruption in the feedback and reduce TCP's dependence on such feedback, that help decrease download time by a factor of fifteen.

The case for persistent-connection HTTP

The results of log-driven simulations of several variants of the proposed modifications to HTTP demonstrate the value of persistent connections.

A Performance Evaluation of HyperText Transfer Protocols

  • Proceedings of ACM SIGMETRICS '99,
  • 1999

Resource management policies for e-commerce servers

A detailed simulation model was developed to assess the gain of adaptive policies with respect to policies that are oblivious to economic considerations and results show that the adaptive priority scheme suggested here can increase, during peak periods, business-oriented metrics such as revenue/sec by as much as 43% over the non priority case.

Berners-Lc¢, "RFC 2616 - Hypertext Transfer Protocol - HTTP/1.1

  • 1999