Association and Sequence Mining in Web Usage

Web servers worldwide generate a vast amount of information on web users’ browsing activities. Several researchers have studied these so-called clickstream or web access log data to better understand and characterize web users. Clickstream data can be enriched with information about the content of v...

Full description

Bibliographic Details
Main Author: Claudia Elena DINUCA
Format: Article
Language:English
Published: Dunarea de Jos University of Galati 2011-06-01
Series:Annals of Dunarea de Jos University. Fascicle I : Economics and Applied Informatics
Subjects:
Online Access:http://www.ann.ugal.ro/eco/Doc2011_2/ClaudiaDinuca.pdf
_version_ 1818152061356736512
author Claudia Elena DINUCA
author_facet Claudia Elena DINUCA
author_sort Claudia Elena DINUCA
collection DOAJ
description Web servers worldwide generate a vast amount of information on web users’ browsing activities. Several researchers have studied these so-called clickstream or web access log data to better understand and characterize web users. Clickstream data can be enriched with information about the content of visited pages and the origin (e.g., geographic, organizational) of the requests. The goal of this project is to analyse user behaviour by mining enriched web access log data. With the continued growth and proliferation of e-commerce, Web services, and Web-based information systems, the volumes of click stream and user data collected by Web-based organizations in their daily operations has reached astronomical proportions. This information can be exploited in various ways, such as enhancing the effectiveness of websites or developing directed web marketing campaigns. The discovered patterns are usually represented as collections of pages, objects, or re-sources that are frequently accessed by groups of users with common needs or interests. The focus of this paper is to provide an overview how to use frequent pattern techniques for discovering different types of patterns in a Web log database. In this paper we will focus on finding association as a data mining technique to extract potentially useful knowledge from web usage data. I implemented in Java, using NetBeans IDE, a program for identification of pages’ association from sessions. For exemplification, we used the log files from a commercial web site.
first_indexed 2024-12-11T13:48:44Z
format Article
id doaj.art-e42bcf10ac6d4b26a224040477ff09d6
institution Directory Open Access Journal
issn 1584-0409
language English
last_indexed 2024-12-11T13:48:44Z
publishDate 2011-06-01
publisher Dunarea de Jos University of Galati
record_format Article
series Annals of Dunarea de Jos University. Fascicle I : Economics and Applied Informatics
spelling doaj.art-e42bcf10ac6d4b26a224040477ff09d62022-12-22T01:04:23ZengDunarea de Jos University of GalatiAnnals of Dunarea de Jos University. Fascicle I : Economics and Applied Informatics1584-04092011-06-01123136Association and Sequence Mining in Web UsageClaudia Elena DINUCAWeb servers worldwide generate a vast amount of information on web users’ browsing activities. Several researchers have studied these so-called clickstream or web access log data to better understand and characterize web users. Clickstream data can be enriched with information about the content of visited pages and the origin (e.g., geographic, organizational) of the requests. The goal of this project is to analyse user behaviour by mining enriched web access log data. With the continued growth and proliferation of e-commerce, Web services, and Web-based information systems, the volumes of click stream and user data collected by Web-based organizations in their daily operations has reached astronomical proportions. This information can be exploited in various ways, such as enhancing the effectiveness of websites or developing directed web marketing campaigns. The discovered patterns are usually represented as collections of pages, objects, or re-sources that are frequently accessed by groups of users with common needs or interests. The focus of this paper is to provide an overview how to use frequent pattern techniques for discovering different types of patterns in a Web log database. In this paper we will focus on finding association as a data mining technique to extract potentially useful knowledge from web usage data. I implemented in Java, using NetBeans IDE, a program for identification of pages’ association from sessions. For exemplification, we used the log files from a commercial web site.http://www.ann.ugal.ro/eco/Doc2011_2/ClaudiaDinuca.pdfClickstream analysisWeb server logsAssociation rulesSessions identificationApriori algorithmWeb usage miningSequence rule mining
spellingShingle Claudia Elena DINUCA
Association and Sequence Mining in Web Usage
Annals of Dunarea de Jos University. Fascicle I : Economics and Applied Informatics
Clickstream analysis
Web server logs
Association rules
Sessions identification
Apriori algorithm
Web usage mining
Sequence rule mining
title Association and Sequence Mining in Web Usage
title_full Association and Sequence Mining in Web Usage
title_fullStr Association and Sequence Mining in Web Usage
title_full_unstemmed Association and Sequence Mining in Web Usage
title_short Association and Sequence Mining in Web Usage
title_sort association and sequence mining in web usage
topic Clickstream analysis
Web server logs
Association rules
Sessions identification
Apriori algorithm
Web usage mining
Sequence rule mining
url http://www.ann.ugal.ro/eco/Doc2011_2/ClaudiaDinuca.pdf
work_keys_str_mv AT claudiaelenadinuca associationandsequencemininginwebusage