Free the Conqueror! Refactoring divide-and-conquer functions

Kozsik, Tamás; Tóth, Melinda; Bozó, István

doi:10.1016/j.future.2017.05.011

Cited by 7 publications

(3 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Another line of research related to our work is the development of techniques and tools to identify computational patterns in sequential programs and refactor these codes in order to enable their effective parallelization. Many of these works consider explicitly or are specifically devoted to the D&C pattern targeted by our skeleton [14,21,30], further justifying the importance of this pattern. Coupling the analysis and transformations proposed by these works with efficient implementations like ours could enable the much desirable automatic optimized parallelization of D&C algorithms by advanced compilers.…”

Section: Related Workmentioning

confidence: 99%

A highly optimized skeleton for unbalanced and deep divide-and-conquer algorithms on multi-core clusters

2022

View full text Add to dashboard Cite

Efficiently implementing the divide-and-conquer pattern of parallelism in distributed memory systems is very relevant, given its ubiquity, and difficult, given its recursive nature and the need to exchange tasks and data among the processors. This task is noticeably further complicated in the presence of multi-core systems, where hybrid parallelism must be exploited to attain the best performance, and when unbalanced and deep workloads are considered, as additional measures must be taken to load balance and avoid deep recursion problems. In this manuscript a parallel skeleton that fulfills all these requirements while providing high levels of usability is presented. In fact, the evaluation shows that our proposal is on average 415.32% faster than MPI codes and 229.18% faster than MPI + OpenMP benchmarks, while offering an average improvement in the programmability metrics of 131.04% over MPI alternatives and 155.18% over MPI + OpenMP solutions.

show abstract

Section: Related Workmentioning

confidence: 99%

A highly optimized skeleton for unbalanced and deep divide-and-conquer algorithms on multi-core clusters

2022

View full text Add to dashboard Cite

show abstract

“…It is interesting to notice that while skeletons have been traditionally directly used by programmers, their scope of application is growing thanks to very promising novel research. Namely, the development of techniques and tools to identify computational patterns and refactor the codes containing them [19,20] not only simplifies the use of skeleton libraries by less experienced users, but it can even lead to the automatic parallelization of complex codes on top of libraries of skeletal operations.…”

Section: Related Workmentioning

confidence: 99%

A Parallel Skeleton for Divide-and-conquer Unbalanced and Deep Problems

Martínez

Fraguela

Dominguez

2021

Int J Parallel Prog

View full text Add to dashboard Cite

The Divide-and-conquer (D&C) pattern appears in a large number of problems and is highly suitable to exploit parallelism. This has led to much research on its easy and efficient application both in shared and distributed memory parallel systems. One of the most successful approaches explored in this area consists of expressing this pattern by means of parallel skeletons which automate and hide the complexity of the parallelization from the user while trying to provide good performance. In this paper, we tackle the development of a skeleton oriented to the efficient parallel resolution of D&C problems with a high degree of imbalance among the subproblems generated and/or a deep level of recurrence. The skeleton achieves in our experiments average speedups between 11% and 18% higher than those of other solutions, reaching a maximum speedup of 78% in some tests. Nevertheless, the new proposal requires an average of between 13% and 29% less programming effort than the usual alternatives.

show abstract

“…Discovering or developing proper means for creating efficient, scalable, and adaptable software for multicore and multimode computers is still an open and a very important problem. Great efforts are made to solve problems related to the efficiency of resource utilization, monitoring and failure handling, and, most importantly, development of highly concurrent systems …”

Section: Introductionmentioning

confidence: 99%