Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. GoldFinger: Fast & Approximate Jaccard for Efficient KNN Graph Constructions
 
research article

GoldFinger: Fast & Approximate Jaccard for Efficient KNN Graph Constructions

Guerraoui, Rachid  
•
Kermarrec, Anne-Marie  
•
Niot, Guilhem
Show more
November 1, 2023
Ieee Transactions On Knowledge And Data Engineering

We propose GoldFinger, a new compact and fast-to-compute binary representation of datasets to approximate Jaccard's index. We illustrate the effectiveness of GoldFinger on the emblematic big data problem of K-Nearest-Neighbor (KNN) graph construction and show that GoldFinger can drastically accelerate a large range of existing KNN algorithms with little to no overhead. As a side effect, we also show that the compact representation of the data protects users' privacy for free by providing k-anonymity and l-diversity. Our extensive evaluation of the resulting approach on several realistic datasets shows that our approach reduces computation times by up to 78.9% compared to raw data while only incurring a negligible to moderate loss in terms of KNN quality. We also show that GoldFinger can be applied to KNN queries (a widely-used search technique) and delivers speedups of up to x3.55 over one of the most efficient approaches to this problem.

  • Details
  • Metrics
Type
research article
DOI
10.1109/TKDE.2022.3232689
Web of Science ID

WOS:001089176900037

Author(s)
Guerraoui, Rachid  
Kermarrec, Anne-Marie  
Niot, Guilhem
Ruas, Olivier
Taiani, Francois
Date Issued

2023-11-01

Publisher

Ieee Computer Soc

Published in
Ieee Transactions On Knowledge And Data Engineering
Volume

35

Issue

11

Start page

11461

End page

11475

Subjects

Technology

•

Knn Graphs

•

Fingerprint

•

Similarity

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

FunderGrant Number

PAMELA project of the French National Research Agency

ANR-16-CE23-0016

Available on Infoscience
February 19, 2024
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/204055
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés