2014 International Symposium on System-on-Chip (SoC) 2014
DOI: 10.1109/issoc.2014.6972435
|View full text |Cite
|
Sign up to set email alerts
|

An implementation of Auto-Memoization mechanism on ARM-based superscalar processor

Abstract: We have proposed a processor called Auto Memoization Processor which is based on computation reuse. Un til now, we have implemented the auto-memoization mechanism on a single-issue non-pipelined SPARC processor and studied the processor. The processor dynamically detects functions and loop iterations as reusable blocks, and memoizes them automatically.In addition, the processor can apply computation reuse to the blocks with a little reuse overhead. However, the fine evaluation result of the processor may not g… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
5
0
1

Year Published

2016
2016
2022
2022

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 6 publications
0
5
0
1
Order By: Relevance
“…Literature 6 provides a clear distinction between them. But when we compare CR [7][8][9][10][11][12][13][14][15][16]30 and MZ, [17][18][19][20][21][22][23][24][25][26][27][28][29]35,36 we could not found sufficient difference among these two. Conceptually, CR refers to "reusing previously computed (buffered) results if same input appears again," and MZ refers to "storing the current result and reuse in future if similar input appears again.…”
Section: Prior Workmentioning
confidence: 99%
“…Literature 6 provides a clear distinction between them. But when we compare CR [7][8][9][10][11][12][13][14][15][16]30 and MZ, [17][18][19][20][21][22][23][24][25][26][27][28][29]35,36 we could not found sufficient difference among these two. Conceptually, CR refers to "reusing previously computed (buffered) results if same input appears again," and MZ refers to "storing the current result and reuse in future if similar input appears again.…”
Section: Prior Workmentioning
confidence: 99%
“…DTM is a reuse technique that operates on traces of instructions and is often implemented on top of Von Neumann‐based superscalar architectures, with further studies that include speculative execution . Speculative execution often improves the reuse rate of traces, because it enables reuse based on speculative values for input operands.…”
Section: Related Workmentioning
confidence: 99%
“…15 The size of each operation, ie, the reuse granularity, can vary from a single instruction 16 to groups of instructions, such as functions, 11 expressions, 17 basic blocks, 18 sub-blocks, 19 or traces. 20 DTM 10 is a reuse technique that operates on traces of instructions and is often implemented on top of Von Neumann-based superscalar architectures, [16][17][18][19][20][21][22][23][24] with further studies that include speculative execution. [25][26][27][28] Speculative execution often improves the reuse rate of traces, because it enables reuse based on speculative values for input operands.…”
Section: Related Workmentioning
confidence: 99%
“…22 Moreover, other studies have implemented similar memoization schemes into ARM-based superscalar processors. [23][24][25][26] Some works have also explored the reuse of computation in the GPU domain. 27 For instance, redundant fragment shader executions have been reused on a mobile GPU through hardware memoization.…”
Section: Related Workmentioning
confidence: 99%