• Publications
  • Influence
E-privacy in 2nd generation E-commerce: privacy preferences versus actual behavior
TLDR
An experiment in which self-reported privacy preferences of 171 participants were compared with their actual disclosing behavior during an online shopping episode, suggesting that current approaches to protect online users' privacy may face difficulties to do so effectively. Expand
A Framework for the Evaluation of Session Reconstruction Heuristics in Web-Usage Analysis
TLDR
A set of performance measures that are sensitive to two types of reconstruction errors and appropriate for different applications in knowledge discovery (KDD) applications are proposed that help the analyst in the selection of the heuristic best suited for the application at hand. Expand
Towards Semantic Web Mining
TLDR
An overview of where the two areas meet today, and ways of how a closer integration could be profitable are sketched. Expand
Privacy in e-commerce: stated preferences vs. actual behavior
TLDR
The possibility of a "transparent human," whose vital information is up for grabs, can most easily be envisioned in the realm of e-commerce, due in part to the large amounts of data available, and the high payoffs expected from using this data for marketing purposes. Expand
Analysis of navigation behaviour in web sites integrating multiple information systems
TLDR
This work proposes the web usage miner (WUM), which discovers navigation patterns subject to advanced statistical and structural constraints in web sites composed of conventional static pages and constructs conceptual hierarchies that reflect the query capabilities used in the production of those pages. Expand
Semantic Web Mining: State of the art and future directions
TLDR
It is argued that the two areas Web Mining and Semantic Web need each other to fulfill their goals, but that the full potential of this convergence is not yet realized. Expand
The Impact of Site Structure and User Environment on Session Reconstruction in Web Usage Analysis
TLDR
Different measures of session reconstruction quality, as well as experiments on the impact on the prediction of frequent entry and exit pages, show that different reconstruction heuristics can be recommended depending on the characteristics of the site. Expand
Measuring the Accuracy of Sessionizers for Web Usage Analysis
TLDR
A set of measures that compute the extent to which real sessions are successfully reconstructed by sessionizers are introduced, and a number of sessionizing tools are evaluated using the log data of a real web site. Expand
RobBERT: a Dutch RoBERTa-based Language Model
TLDR
It is found that RobBERT improves state-of-the-art results for various tasks, and especially significantly outperforms other models when dealing with smaller datasets, indicating that it is a powerful pre-trained model for a large variety of Dutch language tasks. Expand
A user-centric approach to identifying best deployment strategies for language tools: the impact of content and access language on Web user behaviour and attitudes
TLDR
The results indicate that non-English languages are under-represented on the Web and that this is partly due to content-creation, link-setting and link-following behaviour, and suggest that more cross-language tools are desirable. Expand
...
1
2
3
4
5
...