Systems in critical applications employ various hardware and software fault-tolerance techniques to ensure high dependability. Stochastic models are often used to analyze the dependability of these systems and assess the effectiveness of the fault-tolerance techniques employed. Measures like performance and performability of systems are also analyzed using stochastic models. These models take into account randomness in various events in the system (known as aleatory uncertainty) and are solved at fixed parameter values to obtain the measures of interest. However, in real life, the parameters of the stochastic models themselves are uncertain as they are derived from a finite (limited) number of observations or are simply based on expert opinions.Solving the stochastic models at fixed values of the model input parameters result in estimates of model output metrics which do not take into account the uncertainty in model input parameters (known as epistemic uncertainty).In this research work, we address the computation of uncertainty in output metrics of stochastic models due to epistemic uncertainty in model input parameters, with a focus on dependability and performance models of current computer and communication systems. We develop an approach for propagation of epistemic uncertainty in input parameters through stochastic dependability and performance models of varying complexity, to compute the uncertainty in the model output measures. and provides more robust estimates of uncertainty in the model output metrics than previous sampling based methods.We demonstrate the applicability of the uncertainty propagation approach by applying it to analytic stochastic dependability and performance models of computer systems, ranging from simple non-state-space models with a few input parameters to large state-space models and even hierarchical models with more than fifty input parameters. We further apply the uncertainty propagation approach to stochastic models with not only analytic or analytic-numeric solutions but also those with simulative solutions. We also consider a wide range of model output metrics including reliability and availability of computer systems, response time of a web service, capacity oriented availability of a communication system, security (probability of successful attack) of a network routing session, expected number of jobs in a queueing system with breakdown and repair of servers and call handoff probability of a cellular wireless communication cell.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.