2022
DOI: 10.3389/fninf.2021.785068
|View full text |Cite
|
Sign up to set email alerts
|

Routing Brain Traffic Through the Von Neumann Bottleneck: Parallel Sorting and Refactoring

Abstract: Generic simulation code for spiking neuronal networks spends the major part of the time in the phase where spikes have arrived at a compute node and need to be delivered to their target neurons. These spikes were emitted over the last interval between communication steps by source neurons distributed across many compute nodes and are inherently irregular and unsorted with respect to their targets. For finding those targets, the spikes need to be dispatched to a three-dimensional data structure with decisions o… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 8 publications
(8 citation statements)
references
References 39 publications
0
8
0
Order By: Relevance
“…The necessity of modernizing scientific codes is increasingly recognized (Neely et al, 2017 ; de Verdière, 2020 ) and does not spare brain simulator software projects (Brette et al, 2007 ). In the case of NEST these modernizations happened over the past few years, spanning a wide range of both algorithmic and technical improvements (Pronold et al, 2022 ). Others, such as the Brian Simulator (Goodman, 2009 ), have decided to rewrite their codes from scratch, taking the opportunity to overcome limitations of their previous implementations, such as allowing for flexibility in model specification while improving simulator performance (Stimberg et al, 2019 ).…”
Section: Introductionmentioning
confidence: 99%
“…The necessity of modernizing scientific codes is increasingly recognized (Neely et al, 2017 ; de Verdière, 2020 ) and does not spare brain simulator software projects (Brette et al, 2007 ). In the case of NEST these modernizations happened over the past few years, spanning a wide range of both algorithmic and technical improvements (Pronold et al, 2022 ). Others, such as the Brian Simulator (Goodman, 2009 ), have decided to rewrite their codes from scratch, taking the opportunity to overcome limitations of their previous implementations, such as allowing for flexibility in model specification while improving simulator performance (Stimberg et al, 2019 ).…”
Section: Introductionmentioning
confidence: 99%
“…However, the reduced communication and update times here come at the cost of increased delivery times due to an additional indirection introduced with spike compression. Ongoing work targets the delivery phase (Pronold et al, 2021 , 2022 ) and gives a perspective for performance improvements in future releases.…”
Section: Resultsmentioning
confidence: 99%
“…We distinguish between simulators that run on conventional HPC systems and those that use dedicated neuromorphic hardware. Prominent examples of simulators for networks of spiking point-neurons are (Morrison et al, 2005b ; Gewaltig and Diesmann, 2007 ; Plesser et al, 2007 ; Helias et al, 2012 ; Kunkel et al, 2012 , 2014 ; Ippen et al, 2017 ; Kunkel and Schenck, 2017 ; Jordan et al, 2018 ; Pronold et al, 2021 , 2022 ) and (Goodman and Brette, 2008 ; Stimberg et al, 2019 ) using CPUs; (Yavuz et al, 2016 ; Knight and Nowotny, 2018 , 2021 ; Stimberg et al, 2020 ; Knight et al, 2021 ) and (Golosio et al, 2021 ) using GPUs; (Nageswaran et al, 2009 ; Richert et al, 2011 ; Beyeler et al, 2015 ; Chou et al, 2018 ) running on heterogeneous clusters; and the neuromorphic hardware (Furber et al, 2014 ; Rhodes et al, 2019 ). (Carnevale and Hines, 2006 ; Migliore et al, 2006 ; Lytton et al, 2016 ) and (Akar et al, 2019 ) aim for simulating morphologically detailed neuronal networks.…”
Section: Introductionmentioning
confidence: 99%
“…For example, in large-scale networks, synaptic processing substantially dominates the computational load, and the irregular, random access pattern in retrieving the presynaptic data reduce a processor's cache hit rate and increases data access latencies (see e.g., Kunkel et al, 2014 ). The tools of trade here are algorithms that implement high parallelism in computations, “cache-friendly” data structures, and the application of techniques for latency hiding, such as data prefetching (Pronold et al, 2022 ). The proposed HNC node design aims to address these problems—which on conventional computer architectures are a consequence of the von Neumann bottleneck—by implementing performance-critical tasks in hardware.…”
Section: Discussionmentioning
confidence: 99%