Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Right-Protected Data Publishing with Provable Distance-Based Mining
 
research article

Right-Protected Data Publishing with Provable Distance-Based Mining

Zoumpoulis, Spyros I.
•
Vlachos, Michail
•
Freris, Nikolaos M.
Show more
2014
Ieee Transactions On Knowledge And Data Engineering

Protection of one's intellectual property is a topic with important technological and legal facets. We provide mechanisms for establishing the ownership of a dataset consisting of multiple objects. The algorithms also preserve important properties of the dataset, which are important for mining operations, and so guarantee both right protection and utility preservation. We consider a right-protection scheme based on watermarking. Watermarking may distort the original distance graph. Our watermarking methodology preserves important distance relationships, such as: the Nearest Neighbors (NN) of each object and the Minimum Spanning Tree (MST) of the original dataset. This leads to preservation of any mining operation that depends on the ordering of distances between objects, such as NN-search and classification, as well as many visualization techniques. We prove fundamental lower and upper bounds on the distance between objects post-watermarking. In particular, we establish a restricted isometry property, i.e., tight bounds on the contraction/expansion of the original distances. We use this analysis to design fast algorithms for NN-preserving and MST-preserving watermarking that drastically prune the vast search space. We observe two orders of magnitude speedup over the exhaustive schemes, without any sacrifice in NN or MST preservation.

  • Details
  • Metrics
Type
research article
DOI
10.1109/Tkde.2013.90
Web of Science ID

WOS:000341570800015

Author(s)
Zoumpoulis, Spyros I.
Vlachos, Michail
Freris, Nikolaos M.
Lucchese, Claudio
Date Issued

2014

Publisher

Ieee Computer Soc

Published in
Ieee Transactions On Knowledge And Data Engineering
Volume

26

Issue

8

Start page

2014

End page

2028

Subjects

Watermarking

•

nearest neighbors (NN)

•

minimum spanning tree (MST)

•

restricted isometry property (RIP)

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
ISC  
Available on Infoscience
October 23, 2014
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/107806
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés