Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Scalable Robust Graph Embedding with Spark
 
research article

Scalable Robust Graph Embedding with Spark

Chi Thang Duong  
•
Trung Dung Hoang
•
Yin, Hongzhi
Show more
December 1, 2021
Proceedings Of The Vldb Endowment

Graph embedding aims at learning a vector-based representation of vertices that incorporates the structure of the graph. This representation then enables inference of graph properties. Existing graph embedding techniques, however, do not scale well to large graphs. While several techniques to scale graph embedding using compute clusters have been proposed, they require continuous communication between the compute nodes and cannot handle node failure. We therefore propose a framework for scalable and robust graph embedding based on the MapReduce model, which can distribute any existing embedding technique. Our method splits a graph into subgraphs to learn their embeddings in isolation and subsequently reconciles the embedding spaces derived for the subgraphs. We realize this idea through a novel distributed graph decomposition algorithm. In addition, we show how to implement our framework in Spark to enable efficient learning of effective embeddings. Experimental results illustrate that our approach scales well, while largely maintaining the embedding quality.

  • Details
  • Metrics
Type
research article
DOI
10.14778/3503585.3503599
Web of Science ID

WOS:000850111200013

Author(s)
Chi Thang Duong  
Trung Dung Hoang
Yin, Hongzhi
Weidlich, Matthias
Quoc Viet Hung Nguyen  
Aberer, Karl  
Date Issued

2021-12-01

Publisher

ASSOC COMPUTING MACHINERY

Published in
Proceedings Of The Vldb Endowment
Volume

15

Issue

4

Start page

914

End page

922

Subjects

Computer Science, Information Systems

•

Computer Science, Theory & Methods

•

Computer Science

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
LSIR  
Available on Infoscience
September 26, 2022
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/191033
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés