2017
DOI: 10.3390/jlpea7010005
|View full text |Cite
|
Sign up to set email alerts
|

Energy Efficiency Effects of Vectorization in Data Reuse Transformations for Many-Core Processors—A Case Study †

Abstract: Abstract:Thread-level and data-level parallel architectures have become the design of choice in many of today's energy-efficient computing systems. However, these architectures put substantially higher requirements on the memory subsystem than scalar architectures, making memory latency and bandwidth critical in their overall efficiency. Data reuse exploration aims at reducing the pressure on the memory subsystem by exploiting the temporal locality in data accesses. In this paper, we investigate the effects on… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(1 citation statement)
references
References 28 publications
(40 reference statements)
0
1
0
Order By: Relevance
“…Al Hasib et al investigated the effects on performance and energy using a data reuse methodology combined with parallelization and vectorization in multi-and many-core processors. In this reference, a full-search motion estimation kernel was evaluated on an Intel ® (Santa Clara, CA, USA) Xeon Phi TM many-core processor with SSE and the AVX instruction set; the test results revealed that data-level parallel architectures was the design choice in many energy-efficient computing systems [18]. The instruction set can be used to process massive data hashing, comparison, and sorting operations so that the CPU as the effective computing power for flow heat ranking becomes possible.…”
Section: Introductionmentioning
confidence: 99%
“…Al Hasib et al investigated the effects on performance and energy using a data reuse methodology combined with parallelization and vectorization in multi-and many-core processors. In this reference, a full-search motion estimation kernel was evaluated on an Intel ® (Santa Clara, CA, USA) Xeon Phi TM many-core processor with SSE and the AVX instruction set; the test results revealed that data-level parallel architectures was the design choice in many energy-efficient computing systems [18]. The instruction set can be used to process massive data hashing, comparison, and sorting operations so that the CPU as the effective computing power for flow heat ranking becomes possible.…”
Section: Introductionmentioning
confidence: 99%