2022
DOI: 10.48550/arxiv.2201.08656
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Dustin: A 16-Cores Parallel Ultra-Low-Power Cluster with 2b-to-32b Fully Flexible Bit-Precision and Vector Lockstep Execution Mode

Abstract: Computationally intensive algorithms such as Deep Neural Networks (DNNs) are becoming killer applications for edge devices. Porting heavily data-parallel algorithms on resource-constrained and battery-powered devices poses several challenges related to memory footprint, computational throughput, and energy efficiency. Low-bitwidth and mixed-precision arithmetic have been proven to be valid strategies for tackling these problems. We present Dustin, a fully programmable compute cluster integrating 16 RISC-V core… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

1
0

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 24 publications
0
1
0
Order By: Relevance
“…Finally, we compare DARKSIDE with two SoCs that exploit a similar architectural template: Dustin [37] and Vega [23] implement a multi-core RISC-V compute cluster in 65nm and 22nm, respectively. Compared to Vega [23], Darkside delivers better performance on 8-bit integer workloads thanks to the M&L instruction.…”
Section: Comparison With the State-of-the-artmentioning
confidence: 99%
“…Finally, we compare DARKSIDE with two SoCs that exploit a similar architectural template: Dustin [37] and Vega [23] implement a multi-core RISC-V compute cluster in 65nm and 22nm, respectively. Compared to Vega [23], Darkside delivers better performance on 8-bit integer workloads thanks to the M&L instruction.…”
Section: Comparison With the State-of-the-artmentioning
confidence: 99%