Adaptive Fault-Resistant Systems

作者: Jack Goldberg , Kane Kim , Ira Greenberg , E. D. Jensen , Raymond Clark

DOI:

关键词:

摘要: Abstract : The combined effects of faults and resource failures, wide swings in service demand, situation-dependent user requirements, stress a computer's ability to satisfy its expectations. This is an especially significant problem distributed system that employ unreliable communications, whose components may operate different perhaps harsh physical data usage environments. One the goals for adaptive design allow flexible use available resources cover much wider range kinds environmental variables than could be covered by fixed, worst-case design. research focused on tasks of: (1) developing theory fault- resistant systems general principles architectural design; (2) specific techniques; (3) demonstrating designs. Three mechanisms were investigated: Adaptive Distributed Recovery Blocks (ADRBs), multiple-mode scheme error detection recovery, useful both hardware software faults; fault tolerance hybrid faults, economical technique tolerating simple complex types; thread integrity, detecting repairing breaks operating environments using Alpha programming model. systems, blocks, Fault tolerance, Anomaly management.

参考文章(33)
Bruce Jay Nelson, Remote procedure call Carnegie Mellon University. ,(1981)
Makoto Nohmi, Hirokazu Ihara, Kinji Mori, Shyoji Miyamoto, Autonomous Decentralized Loop Network. COMPCON. pp. 192- 196 ,(1982)
H. Kopetz, G. Grünsteidl, J. Reisinger, Fault-Tolerant Membership Service in a Synchronous Distributed Real-Time System Springer, Vienna. pp. 411- 429 ,(1991) , 10.1007/978-3-7091-9123-1_19
J. C. Laprie, Dependability: Basic Concepts and Terminology Dependable Computing and Fault-Tolerant Systems. pp. 3- 245 ,(1992) , 10.1007/978-3-7091-9170-5_1
J. J. Horning, H. C. Lauer, P. M. Melliar-Smith, B. Randell, A program structure for error detection and recovery Lecture Notes in Computer Science. pp. 171- 187 ,(1974) , 10.1007/BFB0029359
Juan A. Garay, Kenneth J. Perry, A Continuum of Failure Models for Distributed Computing international workshop on distributed algorithms. pp. 153- 165 ,(1992) , 10.1007/3-540-56188-9_11
Piotr Berman, Juan A. Garay, Kenneth J. Perry, Optimal Early Stopping in Distributed Consensus (Extended Abstract) international workshop on distributed algorithms. pp. 221- 237 ,(1992) , 10.1007/3-540-56188-9_15
Navin Budhiraja, Keith Marzullo, Fred B. Schneider, Sam Toueg, Primary-Backup Protocols: Lower Bounds and Optimal Implementations Dependable Computing for Critical Applications 3. pp. 321- 343 ,(1993) , 10.1007/978-3-7091-4009-3_14
Satoshi Matsuoka, Takuo Watanabe, Yuuji Ichisugi, Akinori Yonezawa, Object-Oriented Concurrent Reflective Architectures european conference on object-oriented programming. pp. 211- 226 ,(1991) , 10.1007/3-540-55613-3_11
D. Wilson, The STRATUS computer system Resilient computing systems: vol. 1. pp. 208- 231 ,(1986)