Tools and Frameworks for Big Learning in Scala: Leveraging the Language for High Productivity and Performance

Miller, Heather; Haller, Philipp; Odersky, Martin

conference paper not in proceedings

Miller, Heather

•

Haller, Philipp

•

Odersky, Martin

2011

NIPS 2011 Workshop on Parallel and Large-Scale Machine Learning (BigLearn)

Implementing machine learning algorithms for large data, such as the Web graph and social networks, is challenging. Even though much research has focused on making sequential algorithms more scalable, their running times continue to be prohibitively long. Meanwhile, parallelization remains a formidable challenge for this class of problems, despite frameworks like MapReduce which hide much of the associated complexity. We present three ongoing efforts within our team, previously presented at venues in other fields, which aim to make it easier for machine learning researchers and practitioners alike to quickly implement and experiment with their algorithms in a parallel or distributed setting. Furthermore, we hope to highlight some of the language features unique to the Scala programming language in the treatment of our frameworks, in an effort to show how these features can be used to produce efficient and correct parallel systems more easily than ever before.

Name

nips2011.pdf

Access type

openaccess

Size

93.73 KB

Format

Adobe PDF

Checksum (MD5)

f122ab0e2b531eb43180f67ebf1c7925