1990
DOI: 10.1145/255129.255182
|View full text |Cite
|
Sign up to set email alerts
|

Performance evaluation and prediction for parallel algorithms on the BBN GP1000

Abstract: The techniques of "load/store" memory reference modeling is based on deriving performance characteristics of the memory architecture of a computer by looking at the behavior of simple sequences of load, store and nop (null operation) instructions.The resulting data base can be used to match load/store templates against algorithm kernels to predict performance or as a source of data for testing analytical models of the architecture. In this paper we study the BBN GPlOOO parallel processing system. We show how t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

1992
1992
2011
2011

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 15 publications
(5 citation statements)
references
References 1 publication
0
5
0
Order By: Relevance
“…If we neglect the interaction between channels and assume that all internal buses are utilizable by all channels and PEs, then we have the following analytic model developed on top of the previous parallelizing theorem [2,12]: …”
Section: Discussionmentioning
confidence: 99%
“…If we neglect the interaction between channels and assume that all internal buses are utilizable by all channels and PEs, then we have the following analytic model developed on top of the previous parallelizing theorem [2,12]: …”
Section: Discussionmentioning
confidence: 99%
“…Kisuki et al implement a compiler that traverses the optimization space for loop unrolling and tiling and runs all the produced code to choose the best version of a loop kernel [6]. Bodin et al propose an iterative compilation technique that balances code size and performance [19]. These approaches have large compile times because they search a prohibitively large optimization space and they involve running each version of the program in order to gauge its performance.…”
Section: The Promise Of Iterative Compilationmentioning
confidence: 99%
“…If we neglect the interaction between channels and assume that all internal buses are utilizable by all channels and PEs, we have the following analytic model developed on top of a previous parallelizing theorem [44,45]:…”
Section: Appendix 2 Analytical Modelsmentioning
confidence: 99%