Influence of ratio of auxiliary pages on the pre-processing phase of web usage mining
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23510%2F15%3A43925834" target="_blank" >RIV/49777513:23510/15:43925834 - isvavai.cz</a>
Alternative codes found
RIV/00216275:25410/15:39902624
Result on the web
<a href="http://dx.doi.org/10.15240/tul/001/2015-3-013" target="_blank" >http://dx.doi.org/10.15240/tul/001/2015-3-013</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.15240/tul/001/2015-3-013" target="_blank" >10.15240/tul/001/2015-3-013</a>
Alternative languages
Result language
angličtina
Original language name
Influence of ratio of auxiliary pages on the pre-processing phase of web usage mining
Original language description
Data mining belongs to the one of the important tools for Business Intelligence. It is a means to increase competitiveness of a company. Web usage mining is engaged in data mining of web server log file and it analyzes the user's behavior on the web site. The first step of web usage mining process is data pre-processing obtained from a web log file. Data pre-processing is an important part of web usage mining. Discovering patterns of behavior of web visitors depends on the quality of pre-processing phase. Therefore it is important to understand the used methods. This paper summarizes the pre-processing phases and especially the phases of session identification. There are introduced two algorithms for data cleaning and session identification using the reference length method. The main aim of this paper is to compare a calculation of cutoff time and its influence on discovered useful, trivial and inexplicable rules. Cutoff time is an important part of the session identification using the
Czech name
—
Czech description
—
Classification
Type
J<sub>x</sub> - Unclassified - Peer-reviewed scientific article (Jimp, Jsc and Jost)
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
—
Continuities
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Others
Publication year
2015
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
E + M. Ekonomie a Management
ISSN
1212-3609
e-ISSN
—
Volume of the periodical
18
Issue of the periodical within the volume
3
Country of publishing house
CZ - CZECH REPUBLIC
Number of pages
16
Pages from-to
144-159
UT code for WoS article
000361504100013
EID of the result in the Scopus database
—