A Distributed Systems Architecture Supporting High Availability and Reliability

作者: Paul D. Ezhilchelvan , Santosh K. Shrivastava

DOI: 10.1007/978-3-7091-9198-9_4

关键词:

摘要: A reliable distributed systems architecture composed of fail-silent nodes connected by redundant networks is developed. node constructed replicating the computations on two distinct and dedicated processors which check each other’s performance to form a self-checking processor pair. Given that no more than one in fails, guaranteed either function correctly or effectively stop functioning. High availability system services can be obtained application level processes nodes. Managing general, replicated particular, require group communication (multicast communication) services. The paper presents an integrated suit protocols for performing multicasts, illustrates how dual redundancy exploited implementing these protocols.

参考文章(20)
PA Barret, Andrew M Hilborne, Peter G Bond, Douglas T Seaton, Paulo Veríssimo, Luís Rodrigues, Neil A Speirs, None, The Delta-4 extra performance architecture (XPA) [1990] Digest of Papers. Fault-Tolerant Computing: 20th International Symposium. pp. 481- 488 ,(1990) , 10.1109/FTCS.1990.89386
Jo-Mei Chang, N. F. Maxemchuk, Reliable broadcast protocols ACM Transactions on Computer Systems. ,vol. 2, pp. 251- 273 ,(1984) , 10.1145/989.357400
Leslie Lamport, , Time, clocks, and the ordering of events in a distributed system Concurrency and Computation: Practice and Experience. pp. 179- 196 ,(2019) , 10.1145/3335772.3335934
R. L. Rivest, A. Shamir, L. Adleman, A method for obtaining digital signatures and public-key cryptosystems Communications of the ACM. ,vol. 26, pp. 96- 99 ,(1983) , 10.1145/357980.358017
Joseph Y. Halpern, Barbara Simons, Ray Strong, Danny Dolev, Fault-tolerant clock synchronization principles of distributed computing. pp. 89- 102 ,(1984) , 10.1145/800222.806739
Larry L. Peterson, Nick C. Buchholz, Richard D. Schlichting, Preserving and using context information in interprocess communication ACM Transactions on Computer Systems. ,vol. 7, pp. 217- 246 ,(1989) , 10.1145/65000.65001
M.C. Little, S.K. Shrivastava, Replicated K-resilient objects in Arjuna workshop on management of replicated data. pp. 53- 58 ,(1990) , 10.1109/MRD.1990.138245
J.-C. Laprie, J. Arlat, C. Beounes, K. Kanoun, Definition and analysis of hardware- and software-fault-tolerant architectures IEEE Computer. ,vol. 23, pp. 39- 51 ,(1990) , 10.1109/2.56851
Kenneth P. Birman, Thomas A. Joseph, Reliable communication in the presence of failures ACM Transactions on Computer Systems. ,vol. 5, pp. 47- 76 ,(1987) , 10.1145/7351.7478