Distributed Recovery in Applicative Systems

作者: Frank C. H. Lin , Robert M. Keller

DOI:

关键词:

摘要: Applicative systems are promising candidates for achieving high performance computing through aggregation of processors. This paper studies the fault recovery problems in a class applicative systems. The concept functional checkpointing is proposed as nucleus distributed mechanism. entails incrementally building resilient structure evaluation an program proceeds. A simple rollback algorithm suggested to regenerate corrupted by redoing most effective checkpoints. Another algorithm, which attempts recover intermediate results, also presented. parent faulty task reproduces twin failed task. regen~rated inherits all offspring so that partial results can be salvaged.

参考文章(13)
Vason P. Srini, Node Reassignment in a Dataflow System. international conference on distributed computing systems. pp. 15- 27 ,(1984)
Frank C. H. Lin, Robert M. Keller, Gradient Model: A Demand-Driven Load Balancing Scheme. international conference on distributed computing systems. pp. 329- 336 ,(1986)
GARY LINDSTROM, SUHAS PATIL, ROBERT M. KELLER, A loosely-coupled applicative multi-processing system * 1979 International Workshop on Managing Requirements Knowledge (MARK). ,(1979) , 10.1109/MARK.1979.8817294
H. Kopetz, A. Avizienis, T. Anderson, J. C. Laprie, P. A. Lee, Fault tolerance, principles and practice ,(1990)
Keller, Lin, Simulated Performance of a Reduction-Based Multiprocessor IEEE Computer. ,vol. 17, pp. 70- 82 ,(1984) , 10.1109/MC.1984.1659188
M.J. Fischer, N.D. Griffeth, N.A. Lynch, Global States of a Distributed System IEEE Transactions on Software Engineering. ,vol. 8, pp. 198- 202 ,(1982) , 10.1109/TSE.1982.235418
D.J. Taylor, D.E. Morgan, J.P. Black, Redundancy in Data Structures: Some Theoretical Results IEEE Transactions on Software Engineering. ,vol. 6, pp. 595- 602 ,(1980) , 10.1109/TSE.1980.230803
Vegdahl, A Survey of Proposed Architectures for the Execution of Functional Languages IEEE Transactions on Computers. ,vol. 33, pp. 1050- 1071 ,(1984) , 10.1109/TC.1984.1676387
D.J. Taylor, D.E. Morgan, J.P. Black, Redundancy in Data Structures: Improving Software Fault Tolerance IEEE Transactions on Software Engineering. ,vol. 6, pp. 585- 594 ,(1980) , 10.1109/TSE.1980.234507
Gregory R. Andrews, Fred B. Schneider, Concepts and Notations for Concurrent Programming ACM Computing Surveys. ,vol. 15, pp. 3- 43 ,(1983) , 10.1145/356901.356903