2014 Second International Symposium on Computing and Networking 2014
DOI: 10.1109/candar.2014.14
|View full text |Cite
|
Sign up to set email alerts
|

A Time Optimal Parallel Algorithm for the Dynamic Programming on the Hierarchical Memory Machine

Abstract: The Hierarchical Memory Machine (HMM) is a theoretical parallel computing model that captures the essence of architecture of CUDA-enabled GPUs. The main contribution of this paper is to present an efficient implementation of the Ç´Ò ¿ µ-time dynamic programming algorithm for solving the optimal triangulation problem for a convex Ò-gon in the HMM. Although the HMM can run a lot of threads in parallel, it is very hard to accelerate computation involving complicated memory access such as the dynamic programming f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
4
0

Year Published

2018
2018
2018
2018

Publication Types

Select...
2
2

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 20 publications
0
4
0
Order By: Relevance
“…In this section, we show our previous GPU implementation of the optimal polygon triangulation on the GPU . This approach is used to efficiently compute the optimal polygon triangulation only for one instance using parallel threads on the GPU.…”
Section: Our Previous Gpu Implementation Of the Optimal Polygon Trianmentioning
confidence: 99%
See 1 more Smart Citation
“…In this section, we show our previous GPU implementation of the optimal polygon triangulation on the GPU . This approach is used to efficiently compute the optimal polygon triangulation only for one instance using parallel threads on the GPU.…”
Section: Our Previous Gpu Implementation Of the Optimal Polygon Trianmentioning
confidence: 99%
“…In this section, we show our previous GPU implementation of the optimal polygon triangulation on the GPU. 19,25,26 This approach is used to efficiently compute the optimal polygon triangulation only for one instance using parallel threads on the GPU. Therefore, to distinguish this approach from the proposed methods shown in the following, we call it intra-parallel thread assignment.…”
Section: Our Previous Gpu Implementation Of the Optimal Polygon Trianmentioning
confidence: 99%
“…It has been studied well to speed up DP programs using GPU (e.g. [2], [3]), where they mainly focus on optimizing the order of accessing data by proposing novel techniques avoiding memory access conflicts. In this study, we consider adopting a pipeline technique and implementing the DP program on GPU in a pipeline fashion.…”
Section: Introductionmentioning
confidence: 99%
“…It has been studied well to speed up DP programs using GPU (e.g. [1,2]), where they mainly focus on optimizing the order of accessing data by proposing novel techniques avoiding memory access conflicts. In our study, however, we consider adopting a pipeline technique and implementing the DP program on GPU in a pipeline fashion.…”
Section: Introductionmentioning
confidence: 99%