2021
DOI: 10.1109/tc.2020.2987314
|View full text |Cite
|
Sign up to set email alerts
|

Stream Semantic Registers: A Lightweight RISC-V ISA Extension Achieving Full Compute Utilization in Single-Issue Cores

Abstract: Single-issue processor cores are very energy efficient but suffer from the von Neumann bottleneck, in that they must explicitly fetch and issue the loads/storse necessary to feed their ALU/FPU. Each instruction spent on moving data is a cycle not spent on computation, limiting ALU/FPU utilization to 33% on reductions. We propose "Stream Semantic Registers" to boost utilization and increase energy efficiency. SSR is a lightweight, non-invasive RISC-V ISA extension which implicitly encodes memory accesses as reg… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
20
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
1

Relationship

1
6

Authors

Journals

citations
Cited by 26 publications
(20 citation statements)
references
References 29 publications
0
20
0
Order By: Relevance
“…In order to store the intermediate data, like the modern processor architectures [15], [18], the ART-9 core also includes a ternary registerfile (TRF) including nine generalpurposed registers, each of which is accessed by using a 2trit value. Utilizing the load-store architecture used for typical RISC processors [19], there are four instruction categories in ART-9 ISA; R-type, I-type, B-type, and M-type.…”
Section: A Art-9 Instruction Set Architecturementioning
confidence: 99%
See 1 more Smart Citation
“…In order to store the intermediate data, like the modern processor architectures [15], [18], the ART-9 core also includes a ternary registerfile (TRF) including nine generalpurposed registers, each of which is accessed by using a 2trit value. Utilizing the load-store architecture used for typical RISC processors [19], there are four instruction categories in ART-9 ISA; R-type, I-type, B-type, and M-type.…”
Section: A Art-9 Instruction Set Architecturementioning
confidence: 99%
“…As shown in Fig. 4, similar to the lightweight RISC-type designs [19], there are five stages for fetching the instruction from TIM (IF), decoding the fetched instruction (ID), executing the arithmetic/logical operations (EX), accessing the TDM (MEM), and updating the result to TRF (WB). The ternary pipelined registers are newly developed to keep the results from each stage, making a balanced pipelined processing.…”
Section: B 5-stage Pipelined Art-9 Architecturementioning
confidence: 99%
“…Aiming to maximize the compute/control ratio (making the FPU the dominant part of the design) mitigating the effects of deep pipelines and dynamic scheduling. 2) An ISA extension, originally proposed by Schuiki et al [18], called stream semantic register (SSR). This extension accelerates data-oblivious [19] problems by providing an efficient semantic to read and write from memory.…”
Section: Contributionsmentioning
confidence: 99%
“…Achieving 3.5× more energy efficiency and 4.5× better FPU utilization on small matrices than the current state of the art. 2) An implementation of the SSR [18] enhanced with shadow registers to allow overlapping loop-setup with ongoing operations using the FREP extension enabling the usage of our SSR and FREP extensions on more irregular kernels such as Fast Fourier Transform (FFT). Achieving speed-ups of 4.7× in the single-core case and close to 3× in the parallel octa-core case for the FFT benchmark.…”
Section: Contributionsmentioning
confidence: 99%
See 1 more Smart Citation