High-speed, wide area, data intensive computing: a ten year retrospective

Johnston, W.E.

doi:10.1109/hpdc.1998.709982

Cited by 14 publications

(7 citation statements)

References 10 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A tree topology platform: two star-configured racks connected via the backbone cable. The dashed arrows denote one example application with five logical communication links: e a 1 ;b 1 -e a 5 ;b 5 . The processes on each logical link are not explicitly labeled for clarity in the graph.…”

Section: Contributionsmentioning

confidence: 99%

“…1576 J. ZHU ET AL.archy and resource sharing, make communication time prediction non-trivial and challenging for high-performance clusters.On the other hand, such predictions are needed more now than ever because of the increasing importance of data-intensive applications [4,5] that devote a significant amount of their total execution time in parallel processing to I/O or network communication, instead of computation. However, making such predictions accurately is nontrivial when contention exists on different components in hierarchical networks.…”

mentioning

confidence: 99%

“…On the other hand, such predictions are needed more now than ever because of the increasing importance of data-intensive applications [4,5] that devote a significant amount of their total execution time in parallel processing to I/O or network communication, instead of computation. A good usable performance analysis of such data-intensive applications requires that the communication model reflects the network properties accurately on state-of-the-art network topologies and technologies.…”

mentioning

confidence: 99%

“…. 1576 J. ZHU ET AL.archy and resource sharing, make communication time prediction non-trivial and challenging for high-performance clusters.On the other hand, such predictions are needed more now than ever because of the increasing importance of data-intensive applications [4,5] that devote a significant amount of their total execution time in parallel processing to I/O or network communication, instead of computation. A good usable performance analysis of such data-intensive applications requires that the communication model reflects the network properties accurately on state-of-the-art network topologies and technologies.In this article, we consider Ethernet-based network because, compared with custom interconnects (e.g., InfiniBand and Myrinet), it offers widespread compatibility, better cost-performance tradeoff, and a superior road map to 100-Gb standard [6,7].…”

mentioning

confidence: 99%

See 3 more Smart Citations

Asymmetric communication models for resource‐constrained hierarchical ethernet networks

Zhu

Lastovetsky

Ali

et al. 2014

Concurrency and Computation

View full text Add to dashboard Cite

Communication time prediction is critical for parallel application performance tuning, especially for the rapidly growing field of data-intensive applications. However, making such predictions accurately is nontrivial when contention exists on different components in hierarchical networks. In this article, we derive an 'asymmetric network property' on transmission control protocol (TCP) layer for concurrent bidirectional communications in a commercial off-the-shelf (COTS) cluster and develop a communication model as the first effort to characterize the communication times on hierarchical Ethernet networks with contentions on both network interface card and backbone cable levels. We develop a micro-benchmark for a set of simultaneous point-to-point message-passing interface (MPI) operations on a parametrized network topology and use it to validate our model extensively and show that the model can be used to predict the communication times for simultaneous MPI operations (both point-to-point and collective communications) on resourceconstrained networks effectively. We show that if the asymmetric network property is excluded from the model, the communication time predictions will be significantly less accurate than those made by using the asymmetric network property. In addition, we validate the model on a cluster of Grid5000 infrastructure, which is a more loosely coupled platform. As such, we advocate the potential to integrate this model in performance analysis for data-intensive parallel applications. Our observation of the performance degradation caused by the asymmetric network property suggests that some part of the software stack below TCP layer in COTS clusters needs targeted tuning, which has not yet attracted any attention in literature. . 1576 J. ZHU ET AL.archy and resource sharing, make communication time prediction non-trivial and challenging for high-performance clusters.On the other hand, such predictions are needed more now than ever because of the increasing importance of data-intensive applications [4,5] that devote a significant amount of their total execution time in parallel processing to I/O or network communication, instead of computation. A good usable performance analysis of such data-intensive applications requires that the communication model reflects the network properties accurately on state-of-the-art network topologies and technologies.In this article, we consider Ethernet-based network because, compared with custom interconnects (e.g., InfiniBand and Myrinet), it offers widespread compatibility, better cost-performance tradeoff, and a superior road map to 100-Gb standard [6,7]. As of June 2011, 1 or 10 Gb Ethernet has been used as the communication infrastructure in over 45% of the top 500 supercomputers [8]. We use message-passing interface (MPI) as the programming model, which has become the de facto standard for application layer communication on distributed memory systems. On the basis of transmission control protocol (TCP) messaging protocol, MPI over 1 Gb Ethernet has shown c...

show abstract

Section: Contributionsmentioning

confidence: 99%

mentioning

confidence: 99%

mentioning

confidence: 99%

mentioning

confidence: 99%

See 2 more Smart Citations

Asymmetric communication models for resource‐constrained hierarchical ethernet networks

Zhu

Lastovetsky

Ali

et al. 2014

Concurrency and Computation

View full text Add to dashboard Cite

show abstract

“…In the spring of 1989, the concept, data intensive computing, was originated from a demonstration that would relate remote visualization and networking to prove the impact of high-speed networks by Craig Fields [3]. Purely data intensive applications process multi-terabyte to petabytes-sized datasets which commonly comes in several different formats and is often distributed across multiple locations [4].…”

Section: Introductionmentioning

confidence: 99%

Large-scale Real-time Data-driven Scientific Applications

Cao

2011

2011 Second International Conference on Networking and Distributed Computing

View full text Add to dashboard Cite

Abstract-Large-scale real-time data processing is becoming common in many scientific disciplines. But processing large amount of data in real-time is still challenging with existing technology. In last few years, the dynamic data driven approach is becoming people's spotlight due to its potential in reducing data intelligently. Enlighten by this concept, a new data-driven framework for large-scale real-time data analysis is proposed in this work and a scientific application under this framework is given in details. By introducing additional information to data analysis processes, large-scale data processing can be achieved with real-time time constraint.

show abstract

Data-Intensive Technologies for Cloud Computing

Middleton¹

2010

Handbook of Cloud Computing

View full text Add to dashboard Cite

High-speed, wide area, data intensive computing: a ten year retrospective

Abstract: Modern scientific computing involves organizing, moving, visualizing, and

Cited by 14 publications

References 10 publications

Asymmetric communication models for resource‐constrained hierarchical ethernet networks

Asymmetric communication models for resource‐constrained hierarchical ethernet networks

Large-scale Real-time Data-driven Scientific Applications

Data-Intensive Technologies for Cloud Computing

Contact Info

Product

Resources

About