Discovering More Accurate Frequent Web Usage Patterns

Preprint English OPEN
Bayir, Murat Ali ; Toroslu, Ismail Hakki ; Cosar, Ahmet ; Fidan, Guven (2008)
  • Subject: Computer Science - Data Structures and Algorithms | Computer Science - Databases

Web usage mining is a type of web mining, which exploits data mining techniques to discover valuable information from navigation behavior of World Wide Web users. As in classical data mining, data preparation and pattern discovery are the main issues in web usage mining. The first phase of web usage mining is the data processing phase, which includes the session reconstruction operation from server logs. Session reconstruction success directly affects the quality of the frequent patterns discovered in the next phase. In reactive web usage mining techniques, the source data is web server logs and the topology of the web pages served by the web server domain. Other kinds of information collected during the interactive browsing of web site by user, such as cookies or web logs containing similar information, are not used. The next phase of web usage mining is discovering frequent user navigation patterns. In this phase, pattern discovery methods are applied on the reconstructed sessions obtained in the first phase in order to discover frequent user patterns. In this paper, we propose a frequent web usage pattern discovery method that can be applied after session reconstruction phase. In order to compare accuracy performance of session reconstruction phase and pattern discovery phase, we have used an agent simulator, which models behavior of web users and generates web user navigation as well as the log data kept by the web server.
  • References (25)
    25 references, page 1 of 3

    [1] Y. M. A. Nanopoulos, D. Katsaros. Effective prediction of web-user accesses: A data mining approach. In WEBKDD, 2001.

    [2] R. Agrawal and R. Srikant. Fast algorithms for mining association rules in large databases. In VLDB, pages 487-499, 1994.

    [3] R. Agrawal and R. Srikant. Mining sequential patterns. In ICDE, pages 3-14, 1995.

    [4] M. A. Bayir. A new reactive method for processing web usage data. Master's thesis, Middle East Technical University, 2006.

    [5] M. A. Bayir, I. H. Toroslu, and A. Cosar. A new approach for reactive web usage data processing. In ICDE Workshops, page 44, 2006.

    [6] R. Cooley, B. Mobasher, and J. Srivastava. Web mining: Information and pattern discovery on the world wide web. In ICTAI, pages 558-567, 1997.

    [7] R. Cooley, B. Mobasher, and J. Srivastava. Data preparation for mining world wide web browsing patterns. Knowl. Inf. Syst., 1(1):5-32, 1999.

    [8] R. Cooley, P.-N. Tan, and J. Srivastava. Discovery of interesting usage patterns from web data. In WEBKDD, pages 163-182, 1999.

    [9] E. Frias-Martinez and V. Karamcheti. A customizable behavior model for temporal prediction of web user sequences. In WEBKDD, pages 66-85, 2002.

    [10] Y. Fu and M.-Y. Shih. A framework for personal web usage mining. In International Conference on Internet Computing, pages 595-600, 2002.

  • Metrics
    No metrics available
Share - Bookmark