Influence of ratio of auxiliary pages on the pre-processing phase of web usage mining
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23510%2F15%3A43925834" target="_blank" >RIV/49777513:23510/15:43925834 - isvavai.cz</a>
Nalezeny alternativní kódy
RIV/00216275:25410/15:39902624
Výsledek na webu
<a href="http://dx.doi.org/10.15240/tul/001/2015-3-013" target="_blank" >http://dx.doi.org/10.15240/tul/001/2015-3-013</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.15240/tul/001/2015-3-013" target="_blank" >10.15240/tul/001/2015-3-013</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
Influence of ratio of auxiliary pages on the pre-processing phase of web usage mining
Popis výsledku v původním jazyce
Data mining belongs to the one of the important tools for Business Intelligence. It is a means to increase competitiveness of a company. Web usage mining is engaged in data mining of web server log file and it analyzes the user's behavior on the web site. The first step of web usage mining process is data pre-processing obtained from a web log file. Data pre-processing is an important part of web usage mining. Discovering patterns of behavior of web visitors depends on the quality of pre-processing phase. Therefore it is important to understand the used methods. This paper summarizes the pre-processing phases and especially the phases of session identification. There are introduced two algorithms for data cleaning and session identification using the reference length method. The main aim of this paper is to compare a calculation of cutoff time and its influence on discovered useful, trivial and inexplicable rules. Cutoff time is an important part of the session identification using the
Název v anglickém jazyce
Influence of ratio of auxiliary pages on the pre-processing phase of web usage mining
Popis výsledku anglicky
Data mining belongs to the one of the important tools for Business Intelligence. It is a means to increase competitiveness of a company. Web usage mining is engaged in data mining of web server log file and it analyzes the user's behavior on the web site. The first step of web usage mining process is data pre-processing obtained from a web log file. Data pre-processing is an important part of web usage mining. Discovering patterns of behavior of web visitors depends on the quality of pre-processing phase. Therefore it is important to understand the used methods. This paper summarizes the pre-processing phases and especially the phases of session identification. There are introduced two algorithms for data cleaning and session identification using the reference length method. The main aim of this paper is to compare a calculation of cutoff time and its influence on discovered useful, trivial and inexplicable rules. Cutoff time is an important part of the session identification using the
Klasifikace
Druh
J<sub>x</sub> - Nezařazeno - Článek v odborném periodiku (Jimp, Jsc a Jost)
CEP obor
IN - Informatika
OECD FORD obor
—
Návaznosti výsledku
Projekt
—
Návaznosti
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Ostatní
Rok uplatnění
2015
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název periodika
E + M. Ekonomie a Management
ISSN
1212-3609
e-ISSN
—
Svazek periodika
18
Číslo periodika v rámci svazku
3
Stát vydavatele periodika
CZ - Česká republika
Počet stran výsledku
16
Strana od-do
144-159
Kód UT WoS článku
000361504100013
EID výsledku v databázi Scopus
—