2008 International Workshop on Innovative Architecture for Future Generation High-Performance Processors and Systems 2008
DOI: 10.1109/iwia.2008.9
|View full text |Cite
|
Sign up to set email alerts
|

Design and Power Performance Evaluation of On-Chip Memory Processor with Arithmetic Accelerators

Abstract: In this paper, we design an on-chip memory processor with arithmetic accelerators, which are expected to improve power consumption. In addition, we evaluate the power performance of the processor. We propose implementing vector-type arithmetic accelerators and SIMD-type arithmetic accelerators in the on-chip memory processor. The evaluation results obtained using our simulator indicate that the performance of the 4 FMAs SIMD-type accelerators is similar to that of the 4 FMAs vector-type accelerators on DAXPY, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2011
2011
2012
2012

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 13 publications
0
3
0
Order By: Relevance
“…With increasing memory walls, recent approaches have brought the computation units closer to memory, including hierarchical clustering of such combined tiles [66,41]. Despite such optimization, utilizations for GEMM range from 60% down to less than 40% with increasing numbers of tiles.…”
Section: Related Workmentioning
confidence: 99%
“…With increasing memory walls, recent approaches have brought the computation units closer to memory, including hierarchical clustering of such combined tiles [66,41]. Despite such optimization, utilizations for GEMM range from 60% down to less than 40% with increasing numbers of tiles.…”
Section: Related Workmentioning
confidence: 99%
“…With increasing memory walls, recent approaches have brought the computation units closer to memory, including hierarchical clustering of such combined tiles [38], [19]. Despite such optimization, utilizations for GEMM range from 60% down to less than 40% with increasing numbers of tiles.…”
Section: B Related Workmentioning
confidence: 99%
“…Systolic arrays were popularized in the 80s [19]. With increasing memory walls, recent approaches have brought the computation units closer to memory, including hierarchical clustering of such combined tiles [23,16]. Despite such optimization, utilizations for GEMM range from 60% down to less than 40% with increasing number of tiles.…”
Section: Related Workmentioning
confidence: 99%