2017 IEEE Conference on Visual Analytics Science and Technology (VAST) 2017
DOI: 10.1109/vast.2017.8585646
|View full text |Cite
|
Sign up to set email alerts
|

A Visual Analytics System for Optimizing Communications in Massively Parallel Applications

Abstract: Current and future supercomputers have tens of thousands of compute nodes interconnected with high-dimensional networks and complex network topologies for improved performance. Application developers are required to write scalable parallel programs in order to achieve high throughput on these machines. Application performance is largely determined by efficient inter-process communication. A common way to analyze and optimize performance is through profiling parallel codes to identify communication bottlenecks.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 16 publications
(5 citation statements)
references
References 59 publications
0
5
0
Order By: Relevance
“…( 2)) is a quadratic function, and the constraints are linear, the model can be classified as an Integer Quadratic Programming model. In fact, the problem presented is a typical Quadratic Assignment Problem (QAP [17,37]). QAP is considered to be NP-hard, however, considering the relatively small number of topics in the conversation history, this study employs the Gurobi solver to address the aforementioned problem.…”
Section: Global Viewmentioning
confidence: 99%
“…( 2)) is a quadratic function, and the constraints are linear, the model can be classified as an Integer Quadratic Programming model. In fact, the problem presented is a typical Quadratic Assignment Problem (QAP [17,37]). QAP is considered to be NP-hard, however, considering the relatively small number of topics in the conversation history, this study employs the Gurobi solver to address the aforementioned problem.…”
Section: Global Viewmentioning
confidence: 99%
“…Similarly, Bhatele et al [5] analyzed Dragonfly-based networks using a radial layout and a matrix view to show inter-group and intra-group links between the compute nodes. Fujiwara et al [17] utilized node-link diagrams and the matrix-based representations with hierarchical aggregation techniques to generalize to any network topology. Li et al [29] developed flexible visualization to analyze the network performance on the Dragonfly network by applying data aggregation techniques to scale for large scale networks.…”
Section: Performance Visualization Of Parallel Applicationsmentioning
confidence: 99%
“…Because streaming performance data is continuously changing with high-volume and high-variety properties, without any algorithmic and visual supports, conducting the above analysis is almost infeasible. While existing visual analytic systems [17,29] aim to support the analysis of dynamic performance data, support for real-time analysis is lacking.…”
Section: Introductionmentioning
confidence: 99%
“…Predicting user estimates of the runtime of their jobs so as to ensure that they not killed prematurely due to underestimation has also been studied from the point of view of noise and variation [23,66]. Furthermore, several machine learning techniques to predict application performance and detect anomalies for both power and network related variations have been explored [5,9,16,32,43,44,70,73], and the visualization community has come up with novel mechanisms to analyze network traffic and performance data [10,24,31,39,45,69]. However, all the studies listed above have focused on a single objective such as network traffic, or a single constraint such as power.…”
Section: Related Workmentioning
confidence: 99%
“…For our dataset, we varied core count (16,20,24 cores per node), task count (512, 1024, 2048, 4096 MPI ranks), power cap (64 W, 80 W, 115 W per socket), network QoS levels (combinations ranging from service level 0 to 2), and application placement algorithms (packed, spread and random assignment). All benchmarks are MPIbased, compiled with Intel compiler 16.0.3 and MVAPICH 2.2.…”
Section: Summary Of the Experiments Reportedmentioning
confidence: 99%