Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Cluster-and-Conquer: When Randomness Meets Graph Locality
 
conference paper

Cluster-and-Conquer: When Randomness Meets Graph Locality

Giakkoupis, George
•
Kermarrec, Anne-Marie  
•
Ruas, Olivier
Show more
January 1, 2021
2021 Ieee 37Th International Conference On Data Engineering (Icde 2021)
37th IEEE International Conference on Data Engineering (IEEE ICDE)

K-Nearest-Neighbors (KNN) graphs are central to many emblematic data mining and machine-learning applications. Some of the most efficient KNN graph algorithms are incremental and local: they start from a random graph, which they incrementally improve by traversing neighbors-of-neighbors links. Unfortunately, the initial random graph exhibits a poor graph locality, leading to many unnecessary similarity computations. In this paper, we remove this drawback with Cluster-and-Conquer (C-2 for short). Cluster-and-Conquer boosts the starting configuration of greedy algorithms thanks to a novel lightweight clustering mechanism, dubbed FastRandomHash. FastRandomHash leverages randomness and recursion to precluster similar nodes at a very low cost. Our extensive evaluation on real datasets shows that Cluster-and-Conquer significantly outperforms existing approaches, including LSH, yielding speedups of up to x4.42 and even improving the KNN quality.

  • Details
  • Metrics
Type
conference paper
DOI
10.1109/ICDE51399.2021.00195
Web of Science ID

WOS:000687830800187

Author(s)
Giakkoupis, George
Kermarrec, Anne-Marie  
Ruas, Olivier
Taiani, Francois
Date Issued

2021-01-01

Publisher

IEEE COMPUTER SOC

Publisher place

Los Alamitos

Published in
2021 Ieee 37Th International Conference On Data Engineering (Icde 2021)
ISBN of the book

978-1-7281-9184-3

Series title/Series vol.

IEEE International Conference on Data Engineering

Start page

2027

End page

2032

Subjects

Computer Science, Information Systems

•

Computer Science, Theory & Methods

•

Computer Science

•

knn graph

•

big data

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
SACS  
Event nameEvent placeEvent date
37th IEEE International Conference on Data Engineering (IEEE ICDE)

ELECTR NETWORK

Apr 19-22, 2021

Available on Infoscience
September 25, 2021
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/181713
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés