Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Auto-Partitioning Heterogeneous Task-Parallel Programs with StreamBlocks
 
conference paper

Auto-Partitioning Heterogeneous Task-Parallel Programs with StreamBlocks

Emami, Mahyar  
•
Bezati, Endri
•
Janneck, Jorn W.
Show more
January 1, 2022
Proceedings Of The 2022 31St International Conference On Parallel Architectures And Compilation Techniques, Pact 2022
31st International Conference on Parallel Architectures and Compilation Techniques (PACT)

FPGAs play an increasing role in the reconfigurable accelerator landscape. A key challenge in designing FPGA-based systems is partitioning computation between processor cores and FPGAs. An appropriate division of labor is difficult to predict in advance and requires experiments and measurements. When an investigation requires rewriting part of the system in a new language or with a new programming model, its high cost can delay design-space exploration. A single-language system with an appropriate programming model and compiler that targets both platforms transforms this tedious exploration to a simple recompile with new compiler directives. This work introduces StreamBlocks, a unified open-source software/FPGA compiler and runtime that takes dataflow programs written in Cal, and automatically partitions them across heterogeneous CPU/FPGA platforms. The explicit task-parallel semantics of dataflow allows our compiler to simultaneously take advantage of thread parallelism on software and spatial parallelism on hardware. StreamBlocks is augmented with a profile-guided autopartitioning tool that helps identify the best hardware-software partitions. We demonstrate the capability of our compiler in finding the right balance between hardware and software execution on both a high-end datacenter accelerator card and an embedded board. Our experiments exhibit a 4 - 7x speedup over trivial partitions. This speedup is achieved automatically with zero code modifications.

  • Details
  • Metrics
Type
conference paper
DOI
10.1145/3559009.3569659
Web of Science ID

WOS:001071492700030

Author(s)
Emami, Mahyar  
Bezati, Endri
Janneck, Jorn W.
Larus, James R.  
Date Issued

2022-01-01

Publisher

ASSOC COMPUTING MACHINERY

Publisher place

New York

Published in
Proceedings Of The 2022 31St International Conference On Parallel Architectures And Compilation Techniques, Pact 2022
ISBN of the book

978-1-4503-9868-8

Start page

398

End page

411

Subjects

reconfigurable computing

•

actors

•

partitioning

•

data-flow programs

•

design

•

hardware

•

compiler

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
DCSL  
Event nameEvent placeEvent date
31st International Conference on Parallel Architectures and Compilation Techniques (PACT)

Chicago, IL

Oct 08-12, 2022

Available on Infoscience
October 23, 2023
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/201757
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés