Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Conferences, Workshops, Symposiums, and Seminars
  4. Fully-Asynchronous Cache-Efficient Simulation of Detailed Neural Networks
 
conference paper

Fully-Asynchronous Cache-Efficient Simulation of Detailed Neural Networks

Magalhaes, Bruno R. C.
•
Sterling, Thomas
•
Hines, Michael
Show more
January 1, 2019
Computational Science - Iccs 2019, Pt Iii
19th Annual International Conference on Computational Science (ICCS)

Modern asynchronous runtime systems allow the re-thinking of large-scale scientific applications. With the example of a simulator of morphologically detailed neural networks, we show how detaching from the commonly used bulk-synchronous parallel (BSP) execution allows for the increase of prefetching capabilities, better cache locality, and a overlap of computation and communication, consequently leading to a lower time to solution. Our strategy removes the operation of collective synchronization of ODEs' coupling information, and takes advantage of the pairwise time dependency between equations, leading to a fully-asynchronous exhaustive yet not speculative stepping model. Combined with fully linear data structures, communication reduce at compute node level, and an earliest equation steps first scheduler, we perform an acceleration at the cache level that reduces communication and time to solution by maximizing the number of timesteps taken per neuron at each iteration.

Our methods were implemented on the core kernel of the NEURON scientific application. Asynchronicity and distributed memory space are provided by the HPX runtime system for the ParalleX execution model. Benchmark results demonstrate a superlinear speed-up that leads to a reduced runtime compared to the bulk synchronous execution, yielding a speed-up between 25% to 65% across different compute architectures, and in the order of 15% to 40% for distributed executions.

  • Details
  • Metrics
Type
conference paper
DOI
10.1007/978-3-030-22744-9_33
Web of Science ID

WOS:000589293800033

Author(s)
Magalhaes, Bruno R. C.
Sterling, Thomas
Hines, Michael
Schurmann, Felix  
Date Issued

2019-01-01

Publisher

SPRINGER INTERNATIONAL PUBLISHING AG

Publisher place

Cham

Published in
Computational Science - Iccs 2019, Pt Iii
ISBN of the book

978-3-030-22744-9

978-3-030-22743-2

Series title/Series vol.

Lecture Notes in Computer Science

Volume

11538

Start page

421

End page

434

Subjects

Computer Science, Theory & Methods

•

Mathematical & Computational Biology

•

Mathematics, Interdisciplinary Applications

•

Computer Science

•

Mathematics

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

EPFL units
GR-FSCH  
BBP-CORE  
Event nameEvent placeEvent date
19th Annual International Conference on Computational Science (ICCS)

Faro, PORTUGAL

Jun 12-14, 2019

Available on Infoscience
December 1, 2020
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/173737
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés