Analysis of the mean time to data loss of nested disk arrays RAID-01 on basis of a specialized mathematical model P A Rahman -The reliability model of the fault-tolerant computing system with triple-modular redundancy based on the independent nodes P A Abstract. This paper deals with the fault-tolerant data processing systems, which are widely used in modern world of information technologies and have acceptable overhead expenses in hardware implementation. A simplified reliability model for duplex systems and the offered by authors advanced model for data processing systems with primary and backup nodes based on a three-state model of recoverable elements, which takes into consideration different failure rates of passive and active nodes and finite time of node activation, are also given. A calculation formula for the availability factor of the dual-node data processing system with primary and backup nodes and calculation examples are also provided.
IntroductionIn modern world a rapid development of information technologies and their implementation in different spheres of human activity is observed. Almost every day a person has to deal with information. He creates, stores, processes and transmits it using computers and mobile devices. Medium and large-scale enterprises use specialized data storage and processing systems. On their basis a set of information systems operate and assist the business processes of an enterprise. Data processing systems are widely used in modern enterprises, especially high-availability clusters for database systems, which provide fault-tolerant data processing and storage. A cluster is a set of computers linked by a high-speed communication network and logically united by special software for distributed data processing. In practice the dual-node high-availability clusters with shared storage are used because of the acceptable reliability / cost ratio. To avoid the database access conflicts at any time only one node is active and it processes user requests, the other one is passive in a standby mode. For such systems, it is important to know their reliability for estimation of risks to the business processes. In this situation, the development of reliability models and the analysis of reliability indexes for fault-tolerant data processing systems is quite an important task.What concerns the reliability models, on the one hand, there are a number of academic books on the reliability theory [1,2], in which the generalized reliability models of technical systems are discussed, but there are no specific examples related to modern data processing systems, including the high-availability clusters. On the other hand, a number of specialized books [3,4], dedicated to reliability of computing systems and networks, discuss data processing systems, but the given reliability models for duplex systems are too simplified and provide overestimated values for reliability indexes.