2004
DOI: 10.1109/tsp.2004.828946
|View full text |Cite
|
Sign up to set email alerts
|

Dynamic Data Layouts for Cache-Conscious Implementation of a Class of Signal Transforms

Abstract: Abstract-Effective utilization of cache memories is a key factor in achieving high performance for computing large signal transforms. Nonunit stride access in the computation of large signal transforms results in poor cache performance, leading to severe degradation in the overall performance. In this paper, we develop a cache-conscious technique, called a dynamic data layout, to improve the performance of large signal transforms. In our approach, data reorganization is performed between computation stages to … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2014
2014
2016
2016

Publication Types

Select...
1
1
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 26 publications
0
1
0
Order By: Relevance
“…Transforming the memory layout at runtime has the latency log 10 (runtime) [ms] Memory layout transform performance using CPU (Xeon E3-1230), GPU (GTX-670) and HAMLeT (L4-B8-T256 and energy overhead of the roundtrip data movement [22], [4], [18]. Further, it is associated with a bookkeeping cost for updating the page tables [4], [18].…”
Section: Related Workmentioning
confidence: 99%
“…Transforming the memory layout at runtime has the latency log 10 (runtime) [ms] Memory layout transform performance using CPU (Xeon E3-1230), GPU (GTX-670) and HAMLeT (L4-B8-T256 and energy overhead of the roundtrip data movement [22], [4], [18]. Further, it is associated with a bookkeeping cost for updating the page tables [4], [18].…”
Section: Related Workmentioning
confidence: 99%