Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Learning From Heterogeneous Data Based on Social Interactions Over Graphs
 
research article

Learning From Heterogeneous Data Based on Social Interactions Over Graphs

Bordignon, Virginia  
•
Vlaski, Stefan  
•
Matta, Vincenzo
Show more
May 1, 2023
Ieee Transactions On Information Theory

This work proposes a decentralized architecture, where individual agents aim at solving a classification problem while observing streaming features of different dimensions and arising from possibly different distributions. In the context of social learning, several useful strategies have been developed, which solve decision making problems through local cooperation across distributed agents and allow them to learn from streaming data. However, traditional social learning strategies rely on the fundamental assumption that each agent has significant prior knowledge of the underlying distribution of the observations. In this work we overcome this issue by introducing a machine learning framework that exploits social interactions over a graph, leading to a fully data-driven solution to the distributed classification problem. In the proposed social machine learning (SML) strategy, two phases are present: in the training phase, classifiers are independently trained to generate a belief over a set of hypotheses using a finite number of training samples; in the prediction phase, classifiers evaluate streaming unlabeled observations and share their instantaneous beliefs with neighboring classifiers. We show that the SML strategy enables the agents to learn consistently under this highly-heterogeneous setting and allows the network to continue learning even during the prediction phase when it is deciding on unlabeled samples. The prediction decisions are used to continually improve performance thereafter in a manner that is markedly different from most existing static classification schemes where, following training, the decisions on unlabeled data are not re-used to improve future performance.

  • Details
  • Metrics
Type
research article
DOI
10.1109/TIT.2022.3232368
Web of Science ID

WOS:000976117600036

Author(s)
Bordignon, Virginia  
Vlaski, Stefan  
Matta, Vincenzo
Sayed, Ali H.  
Date Issued

2023-05-01

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Published in
Ieee Transactions On Information Theory
Volume

69

Issue

5

Start page

3347

End page

3371

Subjects

Computer Science, Information Systems

•

Engineering, Electrical & Electronic

•

Computer Science

•

Engineering

•

training

•

data models

•

random variables

•

videos

•

predictive models

•

machine learning algorithms

•

distributed databases

•

distributed classification

•

social learning

•

neural networks

•

diffusion strategies

•

distributed detection

•

multiple sensors

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
ASL  
Available on Infoscience
June 5, 2023
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/198020
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés