2012
DOI: 10.1002/cpe.1914
|View full text |Cite
|
Sign up to set email alerts
|

UPCBLAS: a library for parallel matrix computations in Unified Parallel C

Abstract: The popularity of Partitioned Global Address Space (PGAS) languages has increased during the last years thanks to their high programmability and performance through an efficient exploitation of data locality, especially on hierarchical architectures such as multicore clusters. This paper describes UP-CBLAS, a parallel numerical library for dense matrix computations using the PGAS Unified Parallel C (UPC) language. The routines developed in UPCBLAS are built on top of sequential BLAS functions and exploit the p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2012
2012
2018
2018

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 25 publications
0
4
0
Order By: Relevance
“…PGAS languages (such as UPC, Co-Array Fortran [27] or Titanium [28]) are often easier to use than their message passing counterparts [29,30] and can also obtain better performance by using efficient one-sided communication [31][32][33]. UPC++ combines these advantages of the PGAS model with object oriented programming.…”
Section: A Balanced On-demand Distribution Of the Reads Basedmentioning
confidence: 99%
“…PGAS languages (such as UPC, Co-Array Fortran [27] or Titanium [28]) are often easier to use than their message passing counterparts [29,30] and can also obtain better performance by using efficient one-sided communication [31][32][33]. UPC++ combines these advantages of the PGAS model with object oriented programming.…”
Section: A Balanced On-demand Distribution Of the Reads Basedmentioning
confidence: 99%
“…One or more candidate vectors will be affine to each UPC thread (see fig. 4 for illustration of the memory layout); 2 Evaluate an objective function ranking the vectors in the population in parallel; 3 while Termination criteria not satisfied do 4 Each thread T in parallel 5 for i = {1, . .…”
Section: Differential Evolution In Upcmentioning
confidence: 99%
“…On the other hand, wrong communication patterns in which processes unnecessarily access non-local regions of shared memory can be obtained easily due to the simplicity of the model. The PGAS is a popular model for high performance computing (HPC) [3] implemented by a number of programming languages. Most used PGAS languages include Coarray Fortran and Unified Parallel C (UPC) [2], [4], [5].…”
Section: Introductionmentioning
confidence: 99%
“…Our parallel implementation overcomes the scalability issues of pMap thanks to an efficient use of UPC++ [ 18 ], an extension of C++ for parallel computing which has evolved from Unified Parallel C (UPC) [ 19 ]. PGAS (Partitioned Global Address Space) languages (such as UPC, Co-Array Fortran [ 20 ] or Titanium [ 21 ]) are often easier to use than message passing counterparts [ 22 , 23 ] and can also obtain better performance than them thanks to efficient one-sided communication [ 24 – 26 ]. UPC++ combines these advantages of the PGAS model and object oriented programming.…”
Section: Introductionmentioning
confidence: 99%