2022
DOI: 10.1109/tc.2021.3104749
|View full text |Cite
|
Sign up to set email alerts
|

Energy-Efficient Stream Compaction Through Filtering and Coalescing Accesses in GPGPU Memory Partitions

Abstract: Graph-based applications are essential in emerging domains such as data analytics or machine learning. Data gathering in a knowledge-based society requires great data processing efficiency. High-throughput GPGPU architectures are key to enable efficient graph processing. Nonetheless, irregular and sparse memory access patterns present in graph-based applications induce high memory divergence and contention, which result in poor GPGPU efficiency for graph processing. Recent work has pointed out the importance o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 26 publications
0
4
0
Order By: Relevance
“…Graphicionado [23] is the first ASICbased one and can reduce random memory accesses. ISCU [49,50] designs a compacting and filtering technique to prepare data for SMs of GPU for higher GPU utilization. However, when serving the streaming graph processing on CPU, ISCU not only suffers from serious redundant computation overhead, but also needs to issue multiple accesses for the compacting/filtering of multiple data elements that reside in multiple cache lines.…”
Section: Additional Related Workmentioning
confidence: 99%
“…Graphicionado [23] is the first ASICbased one and can reduce random memory accesses. ISCU [49,50] designs a compacting and filtering technique to prepare data for SMs of GPU for higher GPU utilization. However, when serving the streaming graph processing on CPU, ISCU not only suffers from serious redundant computation overhead, but also needs to issue multiple accesses for the compacting/filtering of multiple data elements that reside in multiple cache lines.…”
Section: Additional Related Workmentioning
confidence: 99%
“…The efficiency of accessing memory is a deciding factor in improving performance due to the special multi-threaded execution mode of GPUs. Given that a large number of threads may issue memory access requests at the same time, those requests may be delivered to the off-chip DRAM if the storage hierarchy cannot effectively deal with said mode, leading to numerous threads being blocked and unable to secure the requested data [2,[12][13][14]. This will bring about painful repercussions.…”
Section: Introductionmentioning
confidence: 99%
“…In conclusion, the ISCU leverages the strengths of our previous work on improved graph processing by offloading stream compaction operations, and our work on improved irregular accesses on GPGPU architectures which deliver synergistic improvements in efficient graph processing. This work has been submitted for publication [138].…”
Section: Contributionmentioning
confidence: 99%
“…Our design achieves high energy savings and important speedups for graph processing in modern GPGPU architectures as explored in Chapter 6. This work has been submitted for publication [138].…”
Section: Graph Processing Algorithms On Gpgpu Architecturesmentioning
confidence: 99%