Using PRAM Algorithms on a Uniform-Memory-Access Shared-Memory Architecture

Bader, David A.; Illendula, A.K.; Moret, Bernard M. E.; Weisse-Bernstein, Nina R.

doi:10.1007/3-540-44688-5_11

Cited by 16 publications

(7 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Fast parallel algorithms for irregular problems have been developed for such systems. For instance, we have designed fast parallel graph algorithms and demonstrated speedups compared with the best sequential implementation for problems such as ear decomposition [9], tree contraction and expression evaluation [10], spanning tree [6,8], rooted spanning tree [20], and minimum spanning forest [7]. Many of these algorithms achieve good speedups due to algorithmic techniques for efficient design and better cache performance.…”

Section: Introductionmentioning

confidence: 99%

Designing irregular parallel algorithms with mutual exclusion and lock-free protocols

Cong

Bader

2006

Journal of Parallel and Distributed Computing

Self Cite

View full text Add to dashboard Cite

Section: Introductionmentioning

confidence: 99%

Designing irregular parallel algorithms with mutual exclusion and lock-free protocols

Cong

Bader

2006

Journal of Parallel and Distributed Computing

Self Cite

View full text Add to dashboard Cite

“…SMP clusters are now ubiquitous for high-performance computer, consisting of clusters of multiprocessors nodes (e.g., IBM Regatta, Sun Fire, HP AlphaServer, and SGI Origin) interconnected with high-speed networks (e.g., vendor-supplied, or third party such as Myricom, Quadrics, and InfiniBand). Current research has shown that it is possible to design algorithms for irregular and discrete computations [1][2][3][4] that provide efficient and scalable performance on SMPs.…”

Section: Introductionmentioning

confidence: 99%

A Parallel State Assignment Algorithm for Finite State Machines

Bader

Madduri

2004

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

Abstract. This paper summarizes the design and implementation of a parallel algorithm for state assignment of large Finite State Machines (FSMs). High performance CAD tools are necessary to overcome the computational complexity involved in the optimization of large sequential circuits. FSMs constitute an important class of logic circuits, and state assignment is one of the key steps in combinational logic optimization. The SMP-based parallel algorithm -based on the sequential program JEDI targeting multilevel logic implementation -scales nearly linearly with the number of processors for FSMs of varying problem sizes chosen from standard benchmark suites while attaining quality of results comparable to the best sequential algorithms.

show abstract

“…To analyze SMP performance, we use a complexity model similar to that of Helman and JáJá [20] which has been shown to provide a good cost model for sharedmemory algorithms on current symmetric multiprocessors [19,20,2,3]. The model uses two parameters: the problem's input size n, and the number p of processors.…”

Section: Symmetric Multiprocessors (Smps)mentioning

confidence: 99%

“…Helman and JáJá [19,20] present an ef-ficient list ranking algorithm with implementation on SMP servers that achieves significant parallel speedup. Using this implementation of list ranking, Bader et al have designed fast parallel algorithms and demonstrated speedups compared with the best sequential implementation for graphtheoretic problems such as ear decomposition [2], tree contraction and expression evaluation [3], spanning tree [4], rooted spanning tree [13], and minimum spanning forest [5]. Many of these algorithms achieve good speedups due to algorithmic techniques for efficient design and better cache performance.…”

Section: Introductionmentioning

confidence: 99%