Quantitative and Qualitative Evaluation of Sequence Patterns Found by Application of Different Educational Data Preprocessing Techniques
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216275%3A25410%2F17%3A39911978" target="_blank" >RIV/00216275:25410/17:39911978 - isvavai.cz</a>
Result on the web
<a href="http://dx.doi.org/10.1109/ACCESS.2017.2706302" target="_blank" >http://dx.doi.org/10.1109/ACCESS.2017.2706302</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1109/ACCESS.2017.2706302" target="_blank" >10.1109/ACCESS.2017.2706302</a>
Alternative languages
Result language
angličtina
Original language name
Quantitative and Qualitative Evaluation of Sequence Patterns Found by Application of Different Educational Data Preprocessing Techniques
Original language description
Educational data preprocessing from log files represents a time-consuming phase of the knowledge discovery process. It consists of data cleaning, user identification, session identification, and path completion phase. This paper attempts to identify phases, which are necessary in the case of preprocessing of educational data for further application of learning analytics methods. Since the sequential patterns analysis is considered suitable for estimating of discovered knowledge, this paper tries answering the question, which of these preprocessing phases has a significant impact on discovered knowledge in general, as well as in the meaning of quality and quantity of found sequence patterns. Therefore, several data preprocessing techniques for session identification and path completion were applied to prepare logfiles with different levels of data preprocessing. The results showed that the session identification technique using the reference length, calculated from the sitemap, had a significant impact on the quality of extracted sequence rules. The path completion technique had a significant impact only on the quantity of extracted sequence rules. The found results together with the results of the previous systematic research in educational data preprocessing can improve the automation of the educational data preprocessing phase as well as it can contribute to the development of learning analytics tools suitable for different groups of stakeholders engaged in the educational data mining research activities.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
—
Continuities
S - Specificky vyzkum na vysokych skolach
Others
Publication year
2017
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
IEEE ACCESS
ISSN
2169-3536
e-ISSN
—
Volume of the periodical
5
Issue of the periodical within the volume
5
Country of publishing house
US - UNITED STATES
Number of pages
16
Pages from-to
8989-9004
UT code for WoS article
000404270600033
EID of the result in the Scopus database
—