2012
DOI: 10.1587/elex.9.1147
|View full text |Cite|
|
Sign up to set email alerts
|

Control-enhanced power-SIMD

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2013
2013
2015
2015

Publication Types

Select...
2

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(5 citation statements)
references
References 5 publications
0
5
0
Order By: Relevance
“…Area consumption is measured using the RTL Verilog model of the prototype YHFT Matrix DSP [6] and synthesized using typical operation conditions in TSMC 45 nm technology target at 1 GHz. The total area of DSP is 12 mm Table I, normalized with respect to the "Original" version.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Area consumption is measured using the RTL Verilog model of the prototype YHFT Matrix DSP [6] and synthesized using typical operation conditions in TSMC 45 nm technology target at 1 GHz. The total area of DSP is 12 mm Table I, normalized with respect to the "Original" version.…”
Section: Methodsmentioning
confidence: 99%
“…2 (a) . We have introduced the first two parts in the previous research [6]. In this paper we emphasize the third part, DBC, a dedicated hardware structure for pipelined inter-thread communication.…”
Section: Hardware and Compiler Supportmentioning
confidence: 99%
“…To verify the hardware cost of DBTC mode architecture, we add four major parts to the hardware implementation of the prototype YHFT-matrix processor [11] in Verilog HDL. The prototype contains a scalar control unit, 16 VPEs each with 4 function units, 1MB VM, and 16KB of Instruction cache.…”
Section: Hardware Overheadmentioning
confidence: 99%
“…We have implemented a selection of a set of representative irregular applications to illustrate the key feature of the new architecture, including examples from the EEMBC benchmark suit and applications from the H.264/AVC high definition real-time video compression, as well as two implementations of applications in computer arithmetic that exhibit highly irregular control flow, Table 4 shows the mapping results and the description of the representative applications in DBTC mode. We model our proposed hardware changes using a cycle-accurate modified version of FT-Matrix-sim [11] , and manually optimized assemble code is used as the input of the simulator.…”
Section: Performance Evaluationmentioning
confidence: 99%
See 1 more Smart Citation