Hybrid OpenMP/MPI parallelization of the charge deposition step in the global gyrokinetic Particle-In-Cell code ORB5

Lanti, Emmanuel; Scheinberg, Aaron Lewis; Jocksch, Andreas; Ohana, Noé; Brunner, Stephan; Gheller, Claudio; Villard, Laurent

conference presentation

Lanti, Emmanuel

•

Scheinberg, Aaron Lewis

•

Jocksch, Andreas

more

2017

PASC17

Gyrokinetic simulations are computationally extremely demanding due to the high dimensionality of the physical phase space and the interplay between plasma particles and electromagnetic fields. It is thus essential to make full use of the available numerical resources to be able to simulate more complex physical problems. With the aim of optimizing the gyrokinetic Particle-In-Cell code ORB5 towards exascale computing, a particle sorting method is implemented to increase data locality. Furthermore, different algorithms are used to improve vectorization, and the MPI parallelization is complemented with OpenMP. More specifically, we shall focus on the particle to grid operations involved in the PIC charge deposition step. The latter is critical to parallelize using a shared memory paradigm due to the scatter operations involved. We will present the different algorithms and parallelization schemes implemented in the ORB5 charge deposition step and how they affect the speedup compared to the base MPI case.

Name

EmmanuelLanti_PASC17.pdf

Access type

openaccess

Size

3.27 MB

Format

Adobe PDF

Checksum (MD5)

325efab573b1028bd7ea976dac4e062a