15th DASC. AIAA/IEEE Digital Avionics Systems Conference
DOI: 10.1109/dasc.1996.559128
|View full text |Cite
|
Sign up to set email alerts
|

A methodology for cost-effective software fault tolerance for mission-critical systems

Abstract: As computing capabilities continue to advance, there will be a concurrent rise in the number of both hardware and software faults. These will be caused by the greater volume of more complex software, by the increased number of untested software states, and by more incidents of hardware/software interaction faults as a result of increased hardware speed and density.The traditional software implemented fault tolerance approaches have been successfully utilized in life-critical systems, such as digital flight con… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 3 publications
0
6
0
Order By: Relevance
“…However their application in conjunction provides integrated comprehensive coverage, which dramatically increases the reliability of PANSAT. As mentioned at the beginning, most faulttolerance research has been working with distributed systems, working with a large pool of hardware resources [Kreutzfeld, 1997]. Unfortunately, the particulars of space, especially in this project, preclude the use of most of those features.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…However their application in conjunction provides integrated comprehensive coverage, which dramatically increases the reliability of PANSAT. As mentioned at the beginning, most faulttolerance research has been working with distributed systems, working with a large pool of hardware resources [Kreutzfeld, 1997]. Unfortunately, the particulars of space, especially in this project, preclude the use of most of those features.…”
Section: Discussionmentioning
confidence: 99%
“…If the same state is required to be restored multiple times, an alternate error handling routine is executed. The alternate method may be anything from running a substitute method, ignoring the process that generated the error, if possible, or requesting human intervention to correct a procedure [Kreutzfeld, 1997]. While there is enough slack in the system to perform the acceptance tests, there is neither enough memory nor extra processing time to keep track of multiple program states.…”
Section: Previous Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Developing highly reliable software is often an expensive undertaking due to the prolonged development time, attributable to increased design, review and testing activities [1]. Other trade-offs usually associated with the production of reliable software are functionality, performance, complexity, and safety [2].…”
Section: Introductionmentioning
confidence: 99%
“…For example, avionic applications may use techniques such as error or reasonability checks accompanied by simple recovery mechanisms such as replacement by last valid data [5]. More critical functions such as the system executive or common operating system functions should use techniques such as recovery blocks or N-version programming.…”
Section: 2-37mentioning
confidence: 99%