The RHODOS migration facility

Paoli, D. De; Gościński, Andrzej

doi:10.1016/s0164-1212(97)00014-9

Cited by 22 publications

(10 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…That is, there is no load-balancing policy; the user decides which processes to migrate and when. Examples include Accent [Zayas 1987], Locus [Thiel 1991], Utopia , DEMOS/MP [Powell and Miller 1983], V [Theimer et al 1985], NEST [Agrawal and Ezzet 1987], RHODOS [De Paoli and Goscinski 1995], and MIST [Casas et al 1995].…”

Section: Related Workmentioning

confidence: 99%

Exploiting process lifetime distributions for dynamic load balancing

Harchol-Balter

Downey

1997

ACM Trans. Comput. Syst.

317

110

View full text Add to dashboard Cite

We consider policies for CPU load balancing in networks of workstations. We address the question of whether preemptive migration (migrating active processes) is necessary, or whether remote execution (migrating processes only at the time of birth) is sufficient for load balancing. We show that resolving this issue is strongly tied to understanding the process lifetime distribution. Our measurements indicate that the distribution of lifetimes for a UNIX process is Pareto (heavy-tailed), with a consistent functional form over a variety of workloads. We show how to apply this distribution to derive a preemptive migration policy that requires no hand-tuned parameters. We used a trace-driven simulation to show that our preemptive migration strategy is far more effective than remote execution, even when the memory transfer cost is high.

show abstract

Section: Related Workmentioning

confidence: 99%

Exploiting process lifetime distributions for dynamic load balancing

Harchol-Balter

Downey

1997

ACM Trans. Comput. Syst.

317

110

View full text Add to dashboard Cite

show abstract

“…DNS policies, combined with a simple feedback alarm mechanism from highly utilized servers, effectively avoid Web-server system overload [19] [20]. The DNS, after receiving an address request, selects the least-loaded server.…”

Section: Server-state-based Algorithmsmentioning

confidence: 99%

Dispatcher Based Dynamic Load Balancing on Web Server System

Singh

Kumar

2012

International Journal of System Dynamics Applications

View full text Add to dashboard Cite

show abstract

“…In a study carried out by DePaoli and Goscinski [49], the migration of processes was improved by using copy-on-reference mechanisms. Incorporating copy-on-reference mechanisms into the migration of checkpoints made it possible to migrate only the minimum amount of data required to run a process on a different computer, while the majority of the process' volatile data (user stack and data heap) remained on the original computer.…”

Section: Future Workmentioning

confidence: 99%

A survey and review of the current state of rollback‐recovery for cluster systems

Maloney

Gościński

2009

Concurrency and Computation

Self Cite

View full text Add to dashboard Cite

SUMMARYA variety of research problems exist that require considerable time and computational resources to solve. Attempting to solve these problems produces long-running applications that require a reliable and trustworthy system upon which they can be executed. Cluster systems provide an excellent environment upon which to run these applications because of their low cost to performance ratio; however, due to being created using commodity components they are prone to failures. This report surveyed and reviewed the issues currently relating to providing fault tolerance for long-running applications. Several fault tolerance approaches were investigated; however, it was found that rollback-recovery provides a favourable approach for user applications in cluster systems. Two facilities are required to provide fault tolerance using rollback-recovery: checkpointing and recovery. It was shown here that a multitude of work has been done for enhancing checkpointing; however, the intricacies of providing recovery have been neglected. The problems associated with providing recovery include; providing transparent and autonomic recovery, selecting appropriate recovery computers, and maintaining a consistent observable behaviour when an application fails.

show abstract

The RHODOS migration facility

Cited by 22 publications

References 21 publications

Exploiting process lifetime distributions for dynamic load balancing

Exploiting process lifetime distributions for dynamic load balancing

Dispatcher Based Dynamic Load Balancing on Web Server System

A survey and review of the current state of rollback‐recovery for cluster systems

Contact Info

Product

Resources

About