GPU Acceleration for Simulating Massively Parallel Many-Core Platforms

Raghav, Shivani; Ruggiero, Martino; Marongiu, Andrea; Pinto, Christian; Atienza, David; Benini, Luca

doi:10.1109/Tpds.2014.2319092

research article

GPU Acceleration for Simulating Massively Parallel Many-Core Platforms

Raghav, Shivani

•

Ruggiero, Martino

•

Marongiu, Andrea

more

2015

Ieee Transactions On Parallel And Distributed Systems

Emerging massively parallel architectures such as a general-purpose processor plus many-core programmable accelerators are creating an increasing demand for novel methods to perform their architectural simulation. Most state-of-the-art simulation technologies are exceedingly slow and the need to model full system many-core architectures adds further to the complexity issues. This paper presents a fast, scalable and parallel simulator, which uses a novel methodology to accelerate the simulation of a many-core coprocessor using GPU platforms. The main idea is to use. The target architecture of the associated. Simulation of many target nodes is mapped to the many hardware-threads available on highly parallel GPU platforms. This paper presents a novel methodology to accelerate the simulation of many-core coprocessors using GPU platforms. We demonstrate the challenges, feasibility and benefits of our idea to use heterogeneous system (CPU and GPU) to simulate future architecture of many-core heterogeneous platforms. The target architecture selected to evaluate our methodology consists of an ARM general purpose CPU coupled with many-core coprocessor with thousands of simple in-order cores connected in a tile network. This work presents optimization techniques used to parallelize the simulation specifically for acceleration on GPUs. We partition the full system simulation between CPU and GPU, where the target general purpose CPU is simulated on the host CPU, whereas the many-core coprocessor is simulated on the NVIDIA Tesla 2070 GPU platform. Our experiments show performance of up to 50 MIPS when simulating the entire heterogeneous chip, and high scalability with increasing cores on coprocessor.

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/114219

Name

TDPS2015-06803951.pdf

Type

Publisher's version

Access type

openaccess

Size

1.69 MB

Format

Adobe PDF

Checksum (MD5)

dad9f1f9274b85b370668fd68796e822