2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS) 2015
DOI: 10.1109/samos.2015.7363689
|View full text |Cite
|
Sign up to set email alerts
|

Power optimizations for transport triggered SIMD processors

Abstract: Power consumption in modern processor design is a key aspect. Optimizing the processor for power leads to direct savings in battery energy consumption in case of mobile devices. At the same time, many mobile applications demand high computational performance. In case of large scale computing, low power compute devices help in thermal design and in reducing the electricity bill. This paper presents a case study of a customized low power vector processor design that was synthesized on a 28 nm process technology.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 10 publications
0
2
0
Order By: Relevance
“…As a drawback, the TTA programming model requires large instruction widths of up to 112 bit, increasing the silicon area of the instruction memory and the power consumption. However, mechanisms like instruction register files, variable length encoding, and loop buffers [23,24,33] could be applied in future work to mitigate the effects on area and power consumption. Compared to the ARM Cortex-M0, the transporttriggered architecture configuration C12 has an 8.59 higher performance and 2.49 higher computational energy efficiency at a 1.69 larger total silicon area, showing the considerable range of design trade-offs for different architectures.…”
Section: Discussionmentioning
confidence: 99%
“…As a drawback, the TTA programming model requires large instruction widths of up to 112 bit, increasing the silicon area of the instruction memory and the power consumption. However, mechanisms like instruction register files, variable length encoding, and loop buffers [23,24,33] could be applied in future work to mitigate the effects on area and power consumption. Compared to the ARM Cortex-M0, the transporttriggered architecture configuration C12 has an 8.59 higher performance and 2.49 higher computational energy efficiency at a 1.69 larger total silicon area, showing the considerable range of design trade-offs for different architectures.…”
Section: Discussionmentioning
confidence: 99%
“…Table II shows the estimated power and energy efficiency of the quadcore system. In general, 22.0% of the power is consumed by vector execution units, 8.3% by the vector register file, 2.1% by the scalar ALUs and scalar registers, 8.5% by the interconnect network, 26% by the global memory, 16.9% by The effect on the TTA-specific optimizations on the power consumption was analyzed in preliminary work in [50]. The TTA-specific optimizations provided on average 18% decrease in power consumption in these workloads.…”
Section: Discussionmentioning
confidence: 99%