Permutation-based Sequential Pattern Hiding

Gwadera, Robert; Gkoulalas-Divanis, Aris; Loukides, Grigorios

doi:10.1109/Icdm.2013.57

conference paper

Permutation-based Sequential Pattern Hiding

Gwadera, Robert

•

Gkoulalas-Divanis, Aris

•

Loukides, Grigorios

2013

2013 Ieee 13Th International Conference On Data Mining (Icdm)

IEEE 13th International Conference on Data Mining (ICDM)

Sequence data are increasingly shared to enable mining applications, in various domains such as marketing, telecommunications, and healthcare. This, however, may expose sensitive sequential patterns, which lead to intrusive inferences about individuals or leak confidential information about organizations. This paper presents the first permutation-based approach to prevent this threat. Our approach hides sensitive patterns by replacing them with carefully selected permutations that avoid changes in the set of frequent nonsensitive patterns (side-effects) and in the ordering information of sequences (distortion). By doing so, it retains data utility in sequence mining and tasks based on itemset properties, as permutation preserves the support of items, unlike deletion, which is used in existing works. To realize our approach, we develop an efficient and effective algorithm for generating permutations with minimal side-effects and distortion. This algorithm also avoids implausible symbol orderings that may exist in certain applications. In addition, we propose a method to hide sensitive patterns from a sequence dataset. Extensive experiments verify that our method allows significantly more accurate data analysis than the state-of-the-art approach.

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/103871

Type

conference paper

DOI

10.1109/Icdm.2013.57

Web of Science ID

WOS:000332874200025

Authors

Gwadera, Robert

•

Gkoulalas-Divanis, Aris

•

Loukides, Grigorios

Editors

Xiong, H.

•

Karypis, G.

•

Thuraisingham, B.

•

Cook, D.

•

Wu, X.

Publication date

2013

Publisher

Ieee

Published in

2013 Ieee 13Th International Conference On Data Mining (Icdm)

Publisher place

New York

Total of pages

10

Series title/Series vol.

IEEE International Conference on Data Mining

Start page

241

End page

250

Peer reviewed

REVIEWED

EPFL units

LSIR

Event name

IEEE 13th International Conference on Data Mining (ICDM)

Available on Infoscience

June 2, 2014