2012
DOI: 10.1109/tc.2012.132
|View full text |Cite
|
Sign up to set email alerts
|

Codesign Tradeoffs for High-Performance, Low-Power Linear Algebra Architectures

Abstract: As technology is reaching physical limits, reducing power consumption is the key issue on our path to sustained performance. In this paper, we study fundamental tradeoffs and limits in efficiency (as measured in energy per operation) that can be achieved for an important class of kernels, namely the level-3 Basic Linear Algebra Sub-rountines (BLAS). It is well-accepted that specialization is the key to efficiency. This paper establishes a baseline by studying general matrix-matrix multiplication (GEMM) on a va… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
26
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
5
4
1

Relationship

2
8

Authors

Journals

citations
Cited by 51 publications
(26 citation statements)
references
References 64 publications
0
26
0
Order By: Relevance
“…This question is at least partially answered in [Pedram et al 2012b;Pedram et al 2012a], which examines how to design specialized hardware (both compute core and entire processor) for linear algebra computation. The models used for such purpose have much in common with our model for determining the parameter values for the micro-kernel.…”
Section: Discussionmentioning
confidence: 99%
“…This question is at least partially answered in [Pedram et al 2012b;Pedram et al 2012a], which examines how to design specialized hardware (both compute core and entire processor) for linear algebra computation. The models used for such purpose have much in common with our model for determining the parameter values for the micro-kernel.…”
Section: Discussionmentioning
confidence: 99%
“…Local storage in each PE consists of a bigger single-ported and a smaller dual-ported memory. An extensive study of memory size trade-offs for the core was presented in our previous work [35]. Typically in dense linear algebra problems, access patterns are predictable and in most cases sequential, and there is no need for complex caching schemes.…”
Section: A Lac Architecturementioning
confidence: 99%
“…Our starting point is a Linear Algebra Core (LAC) that we developed in previous work [5]. The core design and its efficiency were originally derived for GEMM operations.…”
Section: Introductionmentioning
confidence: 99%