Parallelizing Machine Learning- Functionally: A Framework and Abstractions for Parallel Graph Processing

Haller, Philipp; Miller, Heather

2011

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Files

Abstract

Implementing machine learning algorithms for large data, such as the Web graph and social networks, is challenging. Even though much research has focused on making sequential algorithms more scalable, their running times continue to be prohibitively long. Meanwhile, parallelization remains a formidable challenge for this class of problems, despite frameworks like MapReduce which hide much of the associated complexity. We present a framework for implementing parallel and distributed machine learning algorithms on large graphs, flexibly, through the use of functional programming abstractions. Our aim is a system that allows researchers and practitioners to quickly and easily implement (and experiment with) their algorithms in a parallel or distributed setting. We introduce functional combinators for the flexible composition of parallel, aggregation, and sequential steps. To the best of our knowledge, our system is the first to avoid inversion of control in a (bulk) synchronous parallel model.

Details

Title Parallelizing Machine Learning- Functionally: A Framework and Abstractions for Parallel Graph Processing

Author(s) Haller, Philipp ; Miller, Heather

Conference 2nd Annual Scala Workshop, Stanford, California, USA, June 2, 2011

Date 2011

Keywords

Parallel programming; distributed programming; machine learning; graph processing

Laboratories LAMP

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > LAMP - Programming Methods Laboratory
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Accepted

Record creation date 2011-04-17

Files

Abstract

Details

PDF