A measurement-based model for estimation of resource exhaustion in operational software systems

作者: K. Vaidyanathan , K.S. Trivedi

DOI: 10.1109/ISSRE.1999.809313

关键词:

摘要: Software systems are known to suffer from outages due transient errors. Recently, the phenomenon of "software aging", in which state software system degrades with time, has been reported (S. Garg et al., 1998). The primary causes this degradation exhaustion operating resources, data corruption and numerical error accumulation. This may eventually lead performance or crash/hang failure, both. Earlier work area detect aging estimate its effect on resources did not take into account workload. In paper, we propose a measurement-based model rate both as function time workload state. A semi-Markov reward is constructed based resource usage collected UNIX system. We first identify different states using statistical cluster analysis build state-space model. Corresponding each resource, then defined for states. solved obtain trends estimated rates time-to-exhaustion resources. With help measure, proactive fault management techniques such rejuvenation" (Y. Huang 1995) be employed prevent unexpected outages.

参考文章(24)
Alessandro Zeigner, Giuseppe Serazzi, Domenico Ferrari, Measurement and tuning of computer systems ,(1983)
S. Garg, A. van Moorsel, K. Vaidyanathan, K.S. Trivedi, A methodology for detection and estimation of software aging international symposium on software reliability engineering. pp. 283- 292 ,(1998) , 10.1109/ISSRE.1998.730892
Pranab Kumar Sen, Estimates of the Regression Coefficient Based on Kendall's Tau Journal of the American Statistical Association. ,vol. 63, pp. 1379- 1389 ,(1968) , 10.1080/01621459.1968.10480934
J. Gray, D.P. Siewiorek, High-availability computer systems IEEE Computer. ,vol. 24, pp. 39- 48 ,(1991) , 10.1109/2.84898
R. Sahner, K.S. Trivedi, A. Puliafito, Performance And Reliability Analysis Of Computer Systems (an Example-based Approach Using The Sharpe Software IEEE Transactions on Reliability. ,vol. 46, pp. 441- 441 ,(1997) , 10.1109/TR.1997.664017
M.C. Hseuh, R.K. Iyer, K.S. Trivedi, Performance modeling based on real data: a case study IEEE Transactions on Computers. ,vol. 37, pp. 478- 484 ,(1988) , 10.1109/12.2195
Victor Nicola, Andrea Bobbio, Kishor Trivedi, A UNIFIED PERFORMANCE RELIABILITY ANALYSIS OF A SYSTEM WITH A CUMULATIVE DOWN TIME CONSTRAINT Microelectronics Reliability. ,vol. 32, pp. 49- 65 ,(1992) , 10.1016/0026-2714(92)90086-Z
R. Chillarege, S. Biyani, J. Rosenthal, Measurement of failure rate in widely distributed software ieee international symposium on fault tolerant computing. pp. 424- 433 ,(1995) , 10.1109/FTCS.1995.466957
Y. Huang, C. Kintala, N. Kolettis, N.D. Fulton, Software rejuvenation: analysis, module and applications ieee international symposium on fault tolerant computing. pp. 381- 390 ,(1995) , 10.1109/FTCS.1995.466961
R.K. Iyer, D.J. Rossetti, Effect of System Workload on Operating System Reliability: A Study on IBM 3081 IEEE Transactions on Software Engineering. ,vol. SE-11, pp. 1438- 1448 ,(1985) , 10.1109/TSE.1985.232180