Fast algorithms for hiding sensitive high-utility itemsets in privacy-preserving utility mining
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61989100%3A27240%2F16%3A86098016" target="_blank" >RIV/61989100:27240/16:86098016 - isvavai.cz</a>
Result on the web
<a href="http://www.sciencedirect.com/science/article/pii/S0952197616301282" target="_blank" >http://www.sciencedirect.com/science/article/pii/S0952197616301282</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1016/j.engappai.2016.07.003" target="_blank" >10.1016/j.engappai.2016.07.003</a>
Alternative languages
Result language
angličtina
Original language name
Fast algorithms for hiding sensitive high-utility itemsets in privacy-preserving utility mining
Original language description
High-Utility Itemset Mining (HUIM) is an extension of frequent itemset mining, which discovers itemsets yielding a high profit in transaction databases (Wits). In recent years, a major issue that has arisen is that data publicly published or shared by organizations may lead to privacy threats since sensitive or confidential information may be uncovered by data mining techniques. To address this issue, techniques for privacy-preserving data mining (PPDM) have been proposed. Recently, privacy-preserving utility mining (PPUM) has become an important topic in PPDM. PPUM is the process of hiding sensitive HUIs (SHUIs) appearing in a database, such that the resulting sanitized database will not reveal these itemsets. In the past, the HHUIF and MSICF algorithms were proposed to hide SHUIs, and are the state-of-the-art approaches for PPUM. In this paper, two novel algorithms, namely Maximum Sensitive Utility-MAximum item Utility (MSU-MAU) and Maximum Sensitive Utility-Minimum item Utility (MSU-MIU), are respectively proposed to minimize the side effects of the sanitization process for hiding SHUIs. The proposed algorithms are designed to efficiently delete SHUIs or decrease their utilities using the concepts of maximum and minimum utility. A projection mechanism is also adopted in the two designed algorithms to speed up the sanitization process. Besides, since the evaluation criteria proposed for PPDM are insufficient and inappropriate for evaluating the sanitization performed by PPUM algorithms, this paper introduces three similarity measures to respectively assess the database structure, database utility and item utility of a sanitized database. These criteria are proposed as a new evaluation standard for PPUM.
Czech name
—
Czech description
—
Classification
Type
J<sub>x</sub> - Unclassified - Peer-reviewed scientific article (Jimp, Jsc and Jost)
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
—
Continuities
S - Specificky vyzkum na vysokych skolach
Others
Publication year
2016
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE
ISSN
0952-1976
e-ISSN
—
Volume of the periodical
55
Issue of the periodical within the volume
October
Country of publishing house
GB - UNITED KINGDOM
Number of pages
16
Pages from-to
269-284
UT code for WoS article
000383811200022
EID of the result in the Scopus database
2-s2.0-84979774559