Learning From Heterogeneous Data Based on Social Interactions Over Graphs

Bordignon, Virginia; Vlaski, Stefan; Matta, Vincenzo; Sayed, Ali H.

doi:10.1109/TIT.2022.3232368

research article

Learning From Heterogeneous Data Based on Social Interactions Over Graphs

Bordignon, Virginia

•

Vlaski, Stefan

•

Matta, Vincenzo

May 1, 2023

Ieee Transactions On Information Theory

This work proposes a decentralized architecture, where individual agents aim at solving a classification problem while observing streaming features of different dimensions and arising from possibly different distributions. In the context of social learning, several useful strategies have been developed, which solve decision making problems through local cooperation across distributed agents and allow them to learn from streaming data. However, traditional social learning strategies rely on the fundamental assumption that each agent has significant prior knowledge of the underlying distribution of the observations. In this work we overcome this issue by introducing a machine learning framework that exploits social interactions over a graph, leading to a fully data-driven solution to the distributed classification problem. In the proposed social machine learning (SML) strategy, two phases are present: in the training phase, classifiers are independently trained to generate a belief over a set of hypotheses using a finite number of training samples; in the prediction phase, classifiers evaluate streaming unlabeled observations and share their instantaneous beliefs with neighboring classifiers. We show that the SML strategy enables the agents to learn consistently under this highly-heterogeneous setting and allows the network to continue learning even during the prediction phase when it is deciding on unlabeled samples. The prediction decisions are used to continually improve performance thereafter in a manner that is markedly different from most existing static classification schemes where, following training, the decisions on unlabeled data are not re-used to improve future performance.

Type

research article

DOI

10.1109/TIT.2022.3232368

Web of Science ID

WOS:000976117600036

Author(s)

Bordignon, Virginia

Vlaski, Stefan

Matta, Vincenzo

Sayed, Ali H.

Date Issued

2023-05-01

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Published in

Ieee Transactions On Information Theory

Volume

69

Issue

5

Start page

3347

End page

3371

Subjects

Computer Science, Information Systems

•

Engineering, Electrical & Electronic

•

Computer Science

•

Engineering

•

training

•

data models

•

random variables

•

videos

•

predictive models

•

machine learning algorithms

•

distributed databases

•

distributed classification

•

social learning

•

neural networks

•

diffusion strategies

•

distributed detection

•

multiple sensors

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units

ASL

Available on Infoscience

June 5, 2023

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/198020