A blocked all-pairs shortest-paths algorithm

Venkataraman, Gayathri; Sahni, Sartaj; Mukhopadhyaya, Srabani

doi:10.1145/996546.996553

Cited by 63 publications

(39 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Since,OpenCL does not support recursion so implementation of recursive function is done in host program which calls OpenCL kernel recursivelyThis Kleene's based parallel recursive algorithm shows a significant speedup over OpenCL parallel Floyd Warshall's algorithm over same GPU. [8] It is a blocked organization of Floyd Warshall's all pairs shortest paths algorithm to make better utilization of cache.Several models for computer with different organization of memory has been developed although L2 cache is architecture dependent.La Marca and Ladner develop a model for single level direct mapped cache.They used this model to analyze the performance of binary heaps and cache aligned dheaps and optimized the cache performance for several sorting methods.Authors obtained a lower bound for the L1 and L2 cache miss rate by determining the minimum number of cache misses and making the reasonable assumption that cache optimization will not decrease the total memory references.…”

Section: International Journal Of Computer Applications (0975 -8887) mentioning

confidence: 99%

Algorithms of All Pair Shortest Path Problem

Susmita¹,

Pandey²

2015

IJCA

View full text Add to dashboard Cite

This paper is based on survey of various algorithms for all pair shortest path problem (APSP) on arbitrary real weighted directed graphs.This paper has summarized existing methods for solving shortest-path problems. In particular, we have addressed both sequential and parallel algorithms. We begin with a review of conventional sequential shortest-path algorithms and later, we have discussed blocked and vectorized implementation, thereby with the aim of reducing computational effort.

show abstract

Section: International Journal Of Computer Applications (0975 -8887) mentioning

confidence: 99%

Algorithms of All Pair Shortest Path Problem

Susmita¹,

Pandey²

2015

IJCA

View full text Add to dashboard Cite

show abstract

“…Our work is similar to the work by Venkataraman et. al [17] but unlike their work we have proposed OpenCL based implementation involving high level of parallelism, data reuse that fully exploits architectural benefits of GPU as a low cost computational resource.…”

Section: Problem Time Complexitymentioning

confidence: 99%

OpenCL Parallel Blocked Approach for Solving All Pairs Shortest Path Problem on GPU

Pandey¹,

Sharma²

2015

IJCA

View full text Add to dashboard Cite

All-Pairs Shortest Path Problem (APSP) finds a large number of practical applications in real world. This paper presents a blocked parallel approach for APSP using an open standard framework OpenCL, which provides development environment for utilizing heterogeneous computing elements of computer system and to take advantage of massive parallel capabilities of multi-core processors such as graphics processing unit (GPU) and CPU. This blocked parallel approach exploits the local shared memory of GPU, thereby enhancing the overall performance. The proposed solution is for directed and dense graphs with no negative cycles and is based on blocked Floyd Warshall (FW) and Kleene"s algorithm. Like Floyd Warshall this approach is also in-place and therefore requires no extra memory.

show abstract

“…Unlike BFS, Floyd-Warshall's algorithm (FW) [18], [19] has O(V 3 ) time complexity, which is irrelevant to the graph sparsity. Blocked FW algorithm [20] is an improved version of FW algorithm. Not only is it more efficient than the basic FW algorithm, it is also more suitable for GPU implementation.…”

Section: E All-pair Shortest Pathsmentioning

confidence: 99%

“…The whole adjacency matrix is first converted to the cost matrix C, where each element C ij represents the path-length of a voxel-pair (i, j). Then the cost matrix is divided into r sub-blocks of equal size [20]. The outer loop iterates over the r primary blocks (the blocks along the diagonal of the matrix).…”

Section: ) Bfs On Cpumentioning

confidence: 99%

A heterogeneous accelerator platform for multi-subject voxel-based brain network analysis

Wang¹,

Xu²,

Ren$^{³

et al. 2011

2011 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

View full text Add to dashboard Cite

Abstract-The research on understanding the human brain has attracted more and more attention. A promising method is to model the brain as a network based on modern imaging technologies and then to apply graph theory algorithms for analysis. In this work, we examine the computing bottleneck of this method, and propose a CPU-GPU heterogeneous platform to accelerate the process. We construct a statistical brain network from a sample of 198 people and get characteristics such as nodal degree and modularity. This is the first study of voxelbased brain networks on large samples. We also illustrate that domain-specific hardware platform can have a significant impact on neuroscience studies.

show abstract

A blocked all-pairs shortest-paths algorithm

Cited by 63 publications

References 16 publications

Algorithms of All Pair Shortest Path Problem

Algorithms of All Pair Shortest Path Problem

OpenCL Parallel Blocked Approach for Solving All Pairs Shortest Path Problem on GPU

A heterogeneous accelerator platform for multi-subject voxel-based brain network analysis

Contact Info

Product

Resources

About