WWW portal usage analysis using genetic algorithms

The article proposes a new method suitable for advanced analysis of web portal visits. This is part of retrieving information and knowledge from web usage data (web usage mining). Such information is necessary in order to gain better insight into visitor’s needs and generally consumer behaviour. By...

Full description

Bibliographic Details
Main Authors: Ondřej Popelka, Jiří Šťastný
Format: Article
Language:English
Published: Mendel University Press 2009-01-01
Series:Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis
Subjects:
Online Access:https://acta.mendelu.cz/57/6/0201/
_version_ 1818426692377509888
author Ondřej Popelka
Jiří Šťastný
author_facet Ondřej Popelka
Jiří Šťastný
author_sort Ondřej Popelka
collection DOAJ
description The article proposes a new method suitable for advanced analysis of web portal visits. This is part of retrieving information and knowledge from web usage data (web usage mining). Such information is necessary in order to gain better insight into visitor’s needs and generally consumer behaviour. By le­ve­ra­ging this information a company can optimize the organization of its internet presentations and offer a better end-user experience. The proposed approach is using Grammatical evolution which is computational method based on genetic algorithms. Grammatical evolution is using a context-free grammar in order to generate the solution in arbitrary reusable form. This allows us to describe visitors’ behaviour in different manners depending on desired further processing. In this article we use description with a procedural programming language. Web server access log files are used as source data.The extraction of behaviour patterns can currently be solved using statistical analysis – specifically sequential analysis based methods. Our objective is to develop an alternative algorithm.The article further describes the basic algorithms of two-level grammatical evolution; this involves basic Grammatical Evolution and Differential Evolution, which forms the second phase of the computation. Grammatical evolution is used to generate the basic structure of the solution – in form of a part of application code. Differential evolution is used to find optimal parameters for this solution – the specific pages visited by a random visitor. The grammar used to conduct experiments is described along with explanations of the links to the actual implementation of the algorithm. Furthermore the fitness function is described and reasons which yield to its’ current shape. Finally the process of analyzing and filtering the raw input data is described as it is vital part in obtaining reasonable results.
first_indexed 2024-12-14T14:33:52Z
format Article
id doaj.art-46098654fa814f199bbb2a03608f082a
institution Directory Open Access Journal
issn 1211-8516
2464-8310
language English
last_indexed 2024-12-14T14:33:52Z
publishDate 2009-01-01
publisher Mendel University Press
record_format Article
series Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis
spelling doaj.art-46098654fa814f199bbb2a03608f082a2022-12-21T22:57:42ZengMendel University PressActa Universitatis Agriculturae et Silviculturae Mendelianae Brunensis1211-85162464-83102009-01-0157620120810.11118/actaun200957060201WWW portal usage analysis using genetic algorithmsOndřej Popelka0Jiří Šťastný1Ústav informatiky, Mendelova zemědělská a lesnická univerzita v Brně, Zemědělská 1, 613 00 Brno, Česká republikaÚstav informatiky, Mendelova zemědělská a lesnická univerzita v Brně, Zemědělská 1, 613 00 Brno, Česká republikaThe article proposes a new method suitable for advanced analysis of web portal visits. This is part of retrieving information and knowledge from web usage data (web usage mining). Such information is necessary in order to gain better insight into visitor’s needs and generally consumer behaviour. By le­ve­ra­ging this information a company can optimize the organization of its internet presentations and offer a better end-user experience. The proposed approach is using Grammatical evolution which is computational method based on genetic algorithms. Grammatical evolution is using a context-free grammar in order to generate the solution in arbitrary reusable form. This allows us to describe visitors’ behaviour in different manners depending on desired further processing. In this article we use description with a procedural programming language. Web server access log files are used as source data.The extraction of behaviour patterns can currently be solved using statistical analysis – specifically sequential analysis based methods. Our objective is to develop an alternative algorithm.The article further describes the basic algorithms of two-level grammatical evolution; this involves basic Grammatical Evolution and Differential Evolution, which forms the second phase of the computation. Grammatical evolution is used to generate the basic structure of the solution – in form of a part of application code. Differential evolution is used to find optimal parameters for this solution – the specific pages visited by a random visitor. The grammar used to conduct experiments is described along with explanations of the links to the actual implementation of the algorithm. Furthermore the fitness function is described and reasons which yield to its’ current shape. Finally the process of analyzing and filtering the raw input data is described as it is vital part in obtaining reasonable results.https://acta.mendelu.cz/57/6/0201/genetic algorithmsdata miningbehaviour patternswww portal
spellingShingle Ondřej Popelka
Jiří Šťastný
WWW portal usage analysis using genetic algorithms
Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis
genetic algorithms
data mining
behaviour patterns
www portal
title WWW portal usage analysis using genetic algorithms
title_full WWW portal usage analysis using genetic algorithms
title_fullStr WWW portal usage analysis using genetic algorithms
title_full_unstemmed WWW portal usage analysis using genetic algorithms
title_short WWW portal usage analysis using genetic algorithms
title_sort www portal usage analysis using genetic algorithms
topic genetic algorithms
data mining
behaviour patterns
www portal
url https://acta.mendelu.cz/57/6/0201/
work_keys_str_mv AT ondrejpopelka wwwportalusageanalysisusinggeneticalgorithms
AT jiristastny wwwportalusageanalysisusinggeneticalgorithms