A Methodology for Profiling and Partitioning Stream Programs on Many-core Architectures

Maximizing the data throughput is a very common implementation objective for several streaming applications. Such task is particularly challenging for implementations based on many-core and multi-core target platforms because, in general, it implies tackling several NP- complete combinatorial problems. Moreover, an efficient design space exploration requires an accurate evaluation on the basis of dataflow program execution profiling. The focus of the paper is on the methodology challenges for obtaining accurate profiling measures. Experimental results validate a many-core platform built by an array of Transport Triggered Architecture processors for exploring the partitioning search space based on the execution trace analysis.

Published in:
Procedia Computer Science, 51, 2962-2966
Presented at:
International Conference on Computational Science (ICCS), Reykjavik, Iceland, June 1-3, 2015

Note: The status of this file is: Anyone

 Record created 2015-07-02, last modified 2020-10-28

Publisher's version:
Download fulltext

Rate this document:

Rate this document:
(Not yet reviewed)