2023
DOI: 10.1126/science.adh1174
|View full text |Cite
|
Sign up to set email alerts
|

Neural inference at the frontier of energy, space, and time

Dharmendra S. Modha,
Filipp Akopyan,
Alexander Andreopoulos
et al.

Abstract: Computing, since its inception, has been processor-centric, with memory separated from compute. Inspired by the organic brain and optimized for inorganic silicon, NorthPole is a neural inference architecture that blurs this boundary by eliminating off-chip memory, intertwining compute with memory on-chip, and appearing externally as an active memory chip. NorthPole is a low-precision, massively parallel, densely interconnected, energy-efficient, and spatial computing architecture with a co-optimized, high-util… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
6
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 33 publications
(7 citation statements)
references
References 47 publications
1
6
0
Order By: Relevance
“…We optimize the overall system timing to minimize the latency, but the NG-RC model and control law is so simple that the evaluation time is two orders of magnitude shorter than the other latencies present in the controller, which are detailed Supplementary Note 7 . Another important aspect of our controller is that we use fixed-point arithmetic, which is matched to the digitized input data and reduces the required compute resources and power consumption 10 .…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…We optimize the overall system timing to minimize the latency, but the NG-RC model and control law is so simple that the evaluation time is two orders of magnitude shorter than the other latencies present in the controller, which are detailed Supplementary Note 7 . Another important aspect of our controller is that we use fixed-point arithmetic, which is matched to the digitized input data and reduces the required compute resources and power consumption 10 .…”
Section: Resultsmentioning
confidence: 99%
“…Regarding the power consumption, it can be likely be reduced to the sub-mW level using a more power-efficient FPGA on a custom circuit board without extraneous components. Because our algorithm only requires a small number of multiplications and additions, the model can be evaluated efficiently on the recent custom artificial-intelligence processors 10 , 11 or using a custom application-specific integrated circuit to achieve the lowest power consumption.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Current approaches relying on artificial neural networks (ANNs) have achieved exceptional performance in complex tasks like image recognition and text generation. , However, this comes at the cost of significantly increased energy consumption due to the inefficiency of computers based on the von Neumann paradigm . To mitigate the energy footprint of ANNs, there is a growing focus on developing computational strategies that mimic the capabilities of biological neural networks .…”
Section: Introductionmentioning
confidence: 99%
“…With immense research effort, CMOS-based neuromorphic chip designs like the TrueNorth [ 2 ], BrainScales [ 3 ], and Loihi 2 [ 4 ] have achieved break-through performance records. In 2023, IBM released the NorthPole chip [ 5 ], which is reported to be 25 times more energy efficient than corresponding graphical processing units (GPUs) in handling deep learning workloads. But the overall energy consumption and chip footprint still need to be considerably lowered.…”
Section: Introductionmentioning
confidence: 99%