Self-Stabilizing Virtual Machine Hypervisor Architecture for Resilient Cloud

Binun, Alexander; Bloch, Mark; Kahil, Martin Ramzi; Menuhin, Boaz; Yagel, Reuven; Coupaye, Thierry; Lacoste, Marc; Wailly, Aurélien

doi:10.1109/services.2014.44

Cited by 9 publications

(3 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Binun et.al focus upon the resilience of the VMM [21]. They present a novel self-stabilising hypervisor for increased robustness against malicious faults.…”

Section: 22mentioning

confidence: 99%

On Resilience in Cloud Computing

Welsh

Benkhelifa

2020

ACM Comput. Surv.

View full text Add to dashboard Cite

Cloud infrastructures are highly favoured as a computing delivery model worldwide, creating a strong societal dependence. It is therefore vital to enhance their resilience, providing persistent service delivery under a variety of conditions. Cloud environments are highly complex and continuously evolving. Additionally, the plethora of use-cases ensures requirements for persistent service delivery vary. As a contribution to knowledge, this work surveys resilience techniques for cloud environments. We apply a novel perspective using a layered model of traditional and emerging cloud paradigms. Works are then classified according to the Resilinets model. For each layer, the most common techniques with limitations are derived including an actor’s strength in influencing resilience in the cloud with each technique. We conclude with some future challenges to the field of resilient cloud computing.

show abstract

“…Binun et.al focus upon the resilience of the VMM [21]. They present a novel self-stabilising hypervisor for increased robustness against malicious faults.…”

Section: 22mentioning

confidence: 99%

On Resilience in Cloud Computing

Welsh

Benkhelifa

2020

ACM Comput. Surv.

View full text Add to dashboard Cite

show abstract

“…Algorithms 1 and 2 provide a high-level description of our solution, and the details appear in algorithms 3, 4, 5 and 6, which implement the proposed solution to the above task specifications by considering the code to be executed by the Cloud, IoT devices, cloudlets, and respectively, the emulators of the replicated state-machine. Algorithm 3 assumes the availability of a self-stabilizing cloud infrastructure, such as [7].…”

Section: Proposed Solutionmentioning

confidence: 99%

A Self-stabilizing Control Plane for the Edge and Fog Ecosystems

Georgiou,

Pallis

et al. 2020

Preprint

View full text Add to dashboard Cite

Fog Computing is now emerging as the dominating paradigm bridging the compute and connectivity gap between sensing devices (a.k.a. "things") and latency-sensitive services. However, as fog deployments scale by accumulating numerous devices interconnected over highly dynamic and volatile network fabrics, the need for self-configuration and self-healing in the presence of failures is more evident now than ever. Using the prevailing methodology of self-stabilization, we propose a fault-tolerant framework for distributed control planes that enables fog services to cope and recover from a very broad fault model. Specifically, our model considers network uncertainties, packet drops, node fail-stop failures and violations of the assumptions according to which the system was designed to operate, such as an arbitrary corruption of the system state. Our self-stabilizing algorithms guarantee automatic recovery within a constant number of communication rounds without the need for external (human) intervention. To showcase the framework's effectiveness, the correctness proof of the proposed self-stabilizing algorithmic process is accompanied by a comprehensive evaluation featuring an open and reproducible testbed utilizing realworld data from the intelligent transportation domain. Results show that our framework ensures a fog ecosystem recovery from faults in constant time, analytics are computed correctly, while the overhead to the system's control plane scales linearly towards the IoT load.

show abstract

“…The latter research forms part of the focus of our research, where the intention is to shed light on some promising trends in cloud computing resilience with completely alternative architectures. This is further discussed in the next section Reactive error ranking and appropriate technique [11] Reactive introspection [12] Proactive high diversity for replica storage [13] Proactive memory stored backups Service Composition Graph based, interdepdency [15] Service Composition Agent-based [16] Low Level Diversity data-centre [17] Quality Adjustment brownout [18] Diversity Replicas Diversity Structure height Organisation VM Scheduling for resilience [20] Disaster Recovery Storage [21] Reactive reset upon fault Hypervisor [22] Proactive Diversity Hypervisor [23] QoR Evaluation Proactive service provisioning Diversity Geo-distribution [25] VM to PHY Mapping Design [26] VM to PHY Mapping Backup links [27] Redundancy VNet backup links III. ALTERNATIVE ARCHITECTURES FOR CLOUD COMPUTING RESILIENCE Some work will choose to encourage a conventionally different cloud architecture in order to provide increased resilience (Fig.…”

Section: State-of-the-art In Cloud Resiliencementioning

confidence: 99%

Perspectives on Resilience in Cloud Computing: Review and Trends

Welsh

Benkhelifa

2017

2017 IEEE/ACS 14th International Conference on Computer Systems and Applications (AICCSA)

View full text Add to dashboard Cite

Abstract-The development of resilient distributed systems is seen as essential to maintaining stable business and staterun processes due to information systems now underpinning most aspects of society. Cloud computing is now one of the most pervasive usage paradigms and due its novelty, research surrounding its resilience is largely lacking and often varied in terms of developed solutions. Therefore this paper provides an up-to-date review of resilience work in cloud computing. This includes methods of measuring and evaluating resilience, solutions for enabling resilience and alternative architectures developed with a focus upon ensuring resilience from the ground up. Firstly, resilience is defined within the context of cloud computing in order to categorise the work appropriately.

show abstract

Self-Stabilizing Virtual Machine Hypervisor Architecture for Resilient Cloud

Cited by 9 publications

References 18 publications

On Resilience in Cloud Computing

On Resilience in Cloud Computing

A Self-stabilizing Control Plane for the Edge and Fog Ecosystems

Perspectives on Resilience in Cloud Computing: Review and Trends

Contact Info

Product

Resources

About