2011 International Conference on Field-Programmable Technology 2011
DOI: 10.1109/fpt.2011.6132668
|View full text |Cite
|
Sign up to set email alerts
|

Cool Mega-Array: A highly energy efficient reconfigurable accelerator

Abstract: A highly energy efficient reconfigurable accelerator called CMA (Cool Mega-Array) is proposed. It consists of a large Processing Element (PE) array without memory elements for maintain result of ALU and configuration data, a small simple programmable micro controller for data management, and the data memory. Unlike traditional coarse grained reconfigurable processors, the power consumption for hardware context switching, storing intermediate data in registers, and clock distribution for them are eliminated fro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2011
2011
2022
2022

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 14 publications
(7 citation statements)
references
References 8 publications
0
7
0
Order By: Relevance
“…Compared with a 41.5-GOPS/ 0.775W (54.8-MOPS/mW) dynamically reconfigurable accelerator [7] and a 3.2-GOPS/50-mW VLIW accelerator (64-MOPS/mW) [4], the CMA-1 has better energy efficiency. Detail analysis of performance and energy consumption of CMA-1 is shown in [1].…”
Section: E Evaluation Summarymentioning
confidence: 99%
“…Compared with a 41.5-GOPS/ 0.775W (54.8-MOPS/mW) dynamically reconfigurable accelerator [7] and a 3.2-GOPS/50-mW VLIW accelerator (64-MOPS/mW) [4], the CMA-1 has better energy efficiency. Detail analysis of performance and energy consumption of CMA-1 is shown in [1].…”
Section: E Evaluation Summarymentioning
confidence: 99%
“…For example, in ADRES [2] the power overhead of the VLIW processor used to handle the data memory access is up to 20%. In CMA [39] the host CPU feeds the data into the PEs through a shared fetch register (FR) file. This is very inefficient in terms of flexibility.…”
Section: A Architecturementioning
confidence: 99%
“…Both of them employ multiple cores with the objective of distributing heterogeneous tasks to each core to save power, not to reach a significant amount of speedup with respect to single-core solutions. Past work on integration of low-power microcontrollers with accelerators has mainly concentrated on coupling with special-purpose computing devices such as specialized DSPs [23] [24], ASICs [5] [25], or reconfigurable computing fabrics [26]. Contrarily to the model we propose, none of these platforms can be considered fully programmable in a general-purpose sense, and no one supports a full offload of code from a host.…”
Section: Related Workmentioning
confidence: 99%