Evaluating Variable-Length Markov Chain Models for Analysis of User Web Navigation Sessions

descriptionPublicationkeyboard_double_arrow_right Article , Preprint 01 Apr 2007Embargo end date: 01 Jan 2006Publisher:Institute of Electrical and Electronics Engineers (IEEE)Journal:IEEE Transactions on Knowledge and Data Engineering, volume 19, pages 441-452 (issn: 1041-4347,

Copyright policy )Funded by:FCT | site-o-matic: Web site au...

Authors: José Luís Cabral de Moura Borges; Mark Levene;

doi: 10.1109/tkde.2007.1012 , 10.48550/arxiv.cs/0606115

arXiv: cs/0606115

Evaluating Variable-Length Markov Chain Models for Analysis of User Web Navigation Sessions

- Summary
- Subjects
- Metrics

Abstract

Markov models have been widely used to represent and analyse user web navigation data. In previous work we have proposed a method to dynamically extend the order of a Markov chain model and a complimentary method for assessing the predictive power of such a variable length Markov chain. Herein, we review these two methods and propose a novel method for measuring the ability of a variable length Markov model to summarise user web navigation sessions up to a given length. While the summarisation ability of a model is important to enable the identification of user navigation patterns, the ability to make predictions is important in order to foresee the next link choice of a user after following a given trail so as, for example, to personalise a web site. We present an extensive experimental evaluation providing strong evidence that prediction accuracy increases linearly with summarisation ability.

Related Organizations

University of Porto
Portugal
Knowledge Systems Institute
United States
School of Engineering
Japan
Universidade Lusófona do Porto
Portugal
School of Engineering
Switzerland

View all View all

Keywords

FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Information Retrieval (cs.IR), Computer Science - Information Retrieval

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	79
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 1%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%