2008 37th International Conference on Parallel Processing 2008
DOI: 10.1109/icpp.2008.65
|View full text |Cite
|
Sign up to set email alerts
|

Application of Automatic Parallelization to Modern Challenges of Scientific Computing Industries

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2009
2009
2022
2022

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 9 publications
0
5
0
Order By: Relevance
“…Figure 13 shows our scalability speedups up to sixteen processors. Benchmark gamess, not measured, is notoriously difficult to parallelize [3] and we disambiguated only small-granularities loops that do not exhibit speedup, while applu exhibits two sequential loops that sum-up to 56% of sequential coverage and contain only sequential or smallgranularity inner loops, which do not contribute to speedup. SPEC2000 We observe that speedups do not scale well between 8 to 16 processors; this is likely because the machine has eight dual-core processors, and executing on both cores decreases the per-core bandwidth.…”
Section: Perfect Club Suite Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Figure 13 shows our scalability speedups up to sixteen processors. Benchmark gamess, not measured, is notoriously difficult to parallelize [3] and we disambiguated only small-granularities loops that do not exhibit speedup, while applu exhibits two sequential loops that sum-up to 56% of sequential coverage and contain only sequential or smallgranularity inner loops, which do not contribute to speedup. SPEC2000 We observe that speedups do not scale well between 8 to 16 processors; this is likely because the machine has eight dual-core processors, and executing on both cores decreases the per-core bandwidth.…”
Section: Perfect Club Suite Resultsmentioning
confidence: 99%
“…The first rule says that a repeated (irreducible) subtraction from a summary should be reorganized as one subtraction between that summary and the union of the subtracted terms 3 . In Figure 8 we show intuitively that when A is included in neither B nor C, perhaps the union of B and C can simplify to a larger set in the arrayabstraction domain which includes A, thus enabling the extraction of a more meaningful predicate.…”
Section: Enabling Usr Transformation For Predicate Extractionmentioning
confidence: 99%
“…The strength of specialized kernel benchmarks is their ability to measure individual machine features by focusing on a particular system aspect. But observations of real application behavior are essential for assessing HPC technology [136], [137].…”
Section: ) Benchmarks Classificationmentioning
confidence: 99%
“…The automatic parallelism has made great progress in numeric programs. But for general-purpose programs it can not exploit sufficient parallelism [7,8]. Since the irregular control flow and data flow in general-purpose programs make it hard for compiler to find enough threads which can execute in parallel without violating dependences.…”
Section: Introductionmentioning
confidence: 99%