Separating the swarm: categorization methods for user sessions on the web

@article{Heer2002SeparatingTS,
  title={Separating the swarm: categorization methods for user sessions on the web},
  author={J. Heer and Ed H. Chi},
  journal={Proceedings of the SIGCHI Conference on Human Factors in Computing Systems},
  year={2002}
}
  • J. Heer, Ed H. Chi
  • Published 2002
  • Computer Science
  • Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Understanding user behaviors on Web sites enables site owners to make sites more usable, ultimately helping users to achieve their goals more quickly. Accordingly, researchers have devised methods for categorizing user sessions in hopes of revealing user interests. These techniques build user profiles by combining users' navigation paths with other data features, such as page viewing time, hyperlink structure, and page content. Previously, we have presented complex techniques of combining many… Expand
An Unsupervised Method for Web User Interest Analysis
TLDR
An unsupervised method to determine dominating user interests from browsing behavior data by partitioning a similarity graph, in which nodes are users and edges represent the similarities in browsing behavior between the users. Expand
Mining the Structure of User Activity using Cluster Stability
TLDR
The utility of a stability-based technique for automatically determining the optimal number of clusters in the context of web user session clustering is examined. Expand
Clustering Web Sessions Using Extended General Pages
TLDR
This work presents two extensions of a hierarchical clustering algorithm, ROCK, and proposes a set of measurements for assessing the results from clustering boolean and categorical data and helps users to identify their desired clustering results. Expand
Examining users on news provider web sites: a review of methodology
This project implemented and reviewed several methods to collect data about users' information seeking behavior on news provider Web sites. While browsing news sites, participants exhibited aExpand
Unsupervised Clickstream Clustering for User Behavior Analysis
TLDR
An unsupervised system to capture dominating user behaviors from clickstream data, and visualize the detected behaviors in an intuitive manner, which effectively identifies previously unknown behaviors, e.g., dormant users, hostile chatters. Expand
LumberJack: Intelligent Discovery and Analysis of Web User Traffic Composition
TLDR
This paper describes the further development of this work into a prototype service called LumberJack, a push-button analysis system that is both more automated and accurate than past systems. Expand
A User-Centered Approach to Evaluating Topic Models
TLDR
Results show that agglomerative techniques – specifically, average-link clustering – provide the most effective methodology for building topic models while ignoring topic evidence and implicit evidence. Expand
Recommender Systems and Collaborative Information Foraging
In order to help people cope with the voluminous amount of information in an online environment, such as the World Wide Web, recent researches in HCI have focused on creating the intelligent systemsExpand
Multimodal Usage Visualization for Large Websites
Large websites pose the following challenges for comprehension of user behavior: users’ behaviors are complex and diverse, the web log data is very noisy, and the quantity of the web log data is of aExpand
An Approach to Extract Informative Rules for Web Page Recommendation by Genetic Programming
TLDR
A method which extracts more informative rules from clickstreams for web page recommendation based on genetic programming and association rules is proposed and verified that the proposed method is more e-ective than theother methods in various conditions. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 22 REFERENCES
Identification of Web User Traffic Composition using Multi-Modal Clustering and Information Scent
TLDR
This paper introduces and describes a method to discover major types of information goals of Web surfers automatically using multi-modal vectors that encompass various sources of information, including Content, Topology, and URL. Expand
Knowledge discovery from users Web-page navigation
  • C. Shahabi, A. Zarkesh, J. Adibi, Vishal Shah
  • Computer Science
  • Proceedings Seventh International Workshop on Research Issues in Data Engineering. High Performance Database Management for Large-Scale Applications
  • 1997
TLDR
A novel path clustering method based on the similarity of the history of user navigation which is capable of capturing the interests of the user which could persist through several subsequent hypertext link selections is introduced. Expand
From User Access Patterns to Dynamic Hypertext Linking
TLDR
The overall design of a system that implements the preprocessing, clustering, and dynamic link suggestion tasks is described, and some experimental results generated by analyzing the access log of a web site. Expand
A Generalization-Based Approach to Clustering of Web Usage Sessions
TLDR
The clustering of Web usage sessions based on the access patterns is studied and it is shown that the approach is efficient and practical for Web mining applications. Expand
Improving the Effectiveness of a Web Site with Web Usage Mining
TLDR
This study proposes a methodology of assessing the quality of a web site in turning its users into customers and proposes a technique for dynamically adapting the site according to those rules. Expand
Clickstream clustering using weighted longest common subsequences
TLDR
A novel and eff ective algorithm for clustering webusers based on a function of the longest common subsequence of their clickst reams that takes into account both the trajectory taken through a website and the time spent at each page. Expand
A Cube Model for Web Access Sessions and Cluster Analysis
TLDR
A cube model to represent Web access sessions for data mining is presented and a new approach to using the first-order Markov transition frequency (or probability) matrix to analyze clustering results for categorical sequences is presented. Expand
Distributions of surfers' paths through the World Wide Web: Empirical characterizations
TLDR
Information‐theoretic measurements suggest that the conditional probabilities of link choice given surf path are more stable over time for shorter paths than longer paths, and direct examination of the accuracy of the conditional probability models in predicting test data suggests that shorter paths yield more stable models and can be estimated reliably with less data than longerpath. Expand
A Cube Model and Cluster Analysis for Web Access Sessions
TLDR
A cube model to represent Web access sessions for data mining, where irregular sessions are converted to a regular data structure to which existing data mining algorithms can be applied while the order of the page sequences is maintained. Expand
How to personalize the Web
TLDR
Web Browser Intelligence (WBI, pronounced “WEB-ee”) is an implemented system that organizes agents on a user’s workstation to observe user actions, proactively offer assistance, modify web documents, and perform new functions. Expand
...
1
2
3
...