SWIFT: Software Implemented Fault Tolerance

Reis, George A.; Chang, Jonathan; Vachharajani, Neil; Rangan, Ram; August, David I.

doi:10.1109/cgo.2005.34

Cited by 568 publications

(427 citation statements)

References 29 publications

Supporting

Mentioning

424

Contrasting

Order By: Relevance

“…Software countermeasures can be designed at different levels, such as at an algorithmic level [6], in a high-level programming language [7][8][9] or at assembly level [10][11][12]. While higher level countermeasures may be optimized away or altered by a compiler, low-level countermeasures are compatible with existing compilers and toolchains.…”

Section: Many Countermeasuresmentioning

confidence: 99%

See 1 more Smart Citation

Efficient Design and Evaluation of Countermeasures against Fault Attacks Using Formal Verification

Goubet

Heydemann

Encrenaz

et al. 2016

Smart Card Research and Advanced Applications

View full text Add to dashboard Cite

Abstract. This paper presents a formal verification framework and tool that evaluates the robustness of software countermeasures against faultinjection attacks. By modeling reference assembly code and its protected variant as automata, the framework can generate a set of equations for an SMT solver, the solutions of which represent possible attack paths. Using the tool we developed, we evaluated the robustness of state-of-theart countermeasures against fault injection attacks. Based on insights gathered from this evaluation, we analyze any remaining weaknesses and propose applications of these countermeasures that are more robust.

show abstract

Section: Many Countermeasuresmentioning

confidence: 99%

“…Software countermeasures are often based on temporal redundancy (i.e. performing the same computation multiple times) to detect or tolerate errors during computations [10,12,8,11]. Control flow protection requires different mechanisms to detect a modification of the execution flow [7,14].…”

Section: Related Workmentioning

confidence: 99%

Efficient Design and Evaluation of Countermeasures against Fault Attacks Using Formal Verification

Goubet

Heydemann

Encrenaz

et al. 2016

Smart Card Research and Advanced Applications

View full text Add to dashboard Cite

show abstract

“…Extensions to the EDDI have been proposed [7] that achieve better efficiency by assuming reliable caches and memory, but still require redundant registers and instructions. Their experiments showed an average normalized execution time of 1.41, but without protection for system memory.…”

Section: Related Workmentioning

confidence: 99%

A Tunable, Software-Based DRAM Error Detection and Correction Library for HPC

Fiala

Ferreira

Mueller

et al. 2012

Euro-Par 2011: Parallel Processing Workshops

View full text Add to dashboard Cite

Abstract. Proposed exascale systems will present a number of considerable resiliency challenges. In particular, DRAM soft-errors, or bit-flips, are expected to greatly increase due to the increased memory density of these systems. Current hardware-based fault-tolerance methods will be unsuitable for addressing the expected soft error frequency rate. As a result, additional software will be needed to address this challenge. In this paper we introduce LIBSDC, a tunable, transparent silent data corruption detection and correction library for HPC applications. LIBSDC provides comprehensive SDC protection for program memory by implementing on-demand page integrity verification. Experimental benchmarks with Mantevo HPCCG show that once tuned, LIBSDC is able to achieve SDC protection with 50% overhead of resources, less than the 100% needed for double modular redundancy.

show abstract

“…Such an assumption could be reasonable if malicious failures are unlikely and each replica maintain a checksum of its history, reporting a failure when it detects that its history is compromised. Alternatively, automated approaches that transform hardware errors into crash failures could be used [5,6]. There is no way to prevent malicious replicas from issuing truncated histories.…”

Section: Safetymentioning

confidence: 99%

“…Various studies in complex systems have shown that crash failures constitute a minority of failures [2,3], while trends in hardware increase the probability of transient hardware errors such as bit flips [4][5][6]. Worse yet, most replication protocols deployed in cloud centers provide weak consistency guarantees, meaning that they introduce inconsistencies even if there are no faults [7,8].…”

Section: Introductionmentioning

confidence: 99%

Byzantine Chain Replication

Renesse

Schiper

2012

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. We present a new class of Byzantine-tolerant State Machine Replication protocols for asynchronous environments that we term Byzantine Chain Replication. We demonstrate two implementations that present different trade-offs between performance and security, and compare these with related work. Leveraging an external reconfiguration service, these protocols are not based on Byzantine consensus, do not require majoritybased quorums during normal operation, and the set of replicas is easy to reconfigure. One of the implementations is instantiated with t + 1 replicas to tolerate t failures and is useful in situations where perimeter security makes malicious attacks unlikely. Applied to in-memory BerkeleyDB replication, it supports 20,000 transactions per second while a fully Byzantine implementation supports 12,000 transactions per second-about 70% of the throughput of a non-replicated database.

show abstract

SWIFT: Software Implemented Fault Tolerance

Cited by 568 publications

References 29 publications

Efficient Design and Evaluation of Countermeasures against Fault Attacks Using Formal Verification

Efficient Design and Evaluation of Countermeasures against Fault Attacks Using Formal Verification

A Tunable, Software-Based DRAM Error Detection and Correction Library for HPC

Byzantine Chain Replication

Contact Info

Product

Resources

About