Proceedings of the Twenty-Fourth Annual ACM Symposium on Parallelism in Algorithms and Architectures 2012
DOI: 10.1145/2312005.2312029
|View full text |Cite
|
Sign up to set email alerts
|

High-performance RMA-based broadcast on the intel SCC

Abstract: Many-core chips with more than 1000 cores are expected by the end of the decade. To overcome scalability issues related to cache coherence at such a scale, one of the main research directions is to leverage the message-passing programming model. The Intel Single-Chip Cloud Computer (SCC) is a prototype of a message-passing many-core chip. It offers the ability to move data between on-chip Message Passing Buffers (MPB) using Remote Memory Access (RMA). Performance of message-passing applications is directly aff… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2013
2013
2024
2024

Publication Types

Select...
4
1
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(8 citation statements)
references
References 20 publications
0
8
0
Order By: Relevance
“…Intel SCC [17]. The main reason is that inter-core communication in these architectures involves some synchronization costs [21] which are not included in our model.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…Intel SCC [17]. The main reason is that inter-core communication in these architectures involves some synchronization costs [21] which are not included in our model.…”
Section: Discussionmentioning
confidence: 99%
“…A few recent studies have proposed performance models for other manycore architectures [21,24]. Our approach is similar to the one used in these papers.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…OC-BCast [17] is another efficient broadcasting algorithm; this algorithm is similar to ModMPB as it also has the receiving cores copy the message from the sending core's MPB to their own MPBs. However, there are a couple distinct differences between the two broadcasts.…”
Section: A Broadcasting On the Sccmentioning
confidence: 99%
“…Partitioned Global Address Space (PGAS), and the wider class of Remote Memory Access (RMA) programming models enable high-performance communications that often outperform Message Passing [19,34]. RMA utilizes remote direct memory access (RDMA) hardware features to access Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page.…”
Section: Introductionmentioning
confidence: 99%