REXEC: A Decentralized, Secure Remote Execution Environment for Clusters

作者: Brent N. Chun , David E. Culler

DOI: 10.1007/10720115_1

关键词:

摘要: Bringing clusters of computers into the mainstream as general-purpose computing systems requires that better facilities for transparent remote execution parallel and sequential applications be developed. While much research has been done in this area, most work remains inaccessible built using contemporary hardware operating systems. Implementations are either too old and/or not publicly available, require use which supported by modern hardware, or simply do meet functional requirements demanded practical real world settings. To address these issues, we designed REXEC, a decentralized, secure facility. It provides high availability, scalability, execution, dynamic cluster configuration, decoupled node discovery selection, well-defined failure cleanup model, distributed program support, strong authentication encryption. The system is implemented currently installed on 32-node 2-way SMPs running Linux 2.2.5 system.

参考文章(20)
Jim Basney, Miron Livny, Todd Tannenbaum, Michael Litzkow, Checkpoint and Migration of UNIX Processes in the Condor Distributed Processing System University of Wisconsin-Madison Department of Computer Sciences. ,(1997)
Carl A Waldspurger, William E Weihl, Stride Scheduling: Deterministic Proportional- Share Resource Management Massachusetts Institute of Technology. ,(1995)
James S. Plank, Kai Li, Micah Beck, Gerry Kingsley, Libckpt: transparent checkpointing under Unix usenix annual technical conference. pp. 18- 18 ,(1995)
Ken Shirriff, Building distributed process management on an object-oriented framework usenix annual technical conference. pp. 9- 9 ,(1997)
Brent Nee Chun, David E Culler, Market-based Proportional Resource Sharing for Clusters University of California at Berkeley. ,(2000)
Vlada Matena, Moti Thadani, Ken Shirriff, Yousef A. Khalidi, Jose M. Bernabeu, Solaris MC: A Multi-Computer OS usenix annual technical conference. pp. 16- 16 ,(1996)
R. Agrawal, A.K. Ezzat, Location Independent Remote Execution in NEST IEEE Transactions on Software Engineering. ,vol. 13, pp. 905- 912 ,(1987) , 10.1109/TSE.1987.233509
Antônio Marinho Pilla Barcellos, Valdir Rossi Belmonte Filho, Joao Frederico Lacava Schramm, Claudio Fernando Resin Geyer, The HetNOS network operating system: a tool for writing distributed applications Operating Systems Review. ,vol. 28, pp. 34- 47 ,(1994) , 10.1145/191525.191534
John K. Ousterhout, Andrew R. Cherenson, Fred Douglis, Michael N. Nelson, Brent B. Welch, None, The Sprite network operating system IEEE Computer. ,vol. 21, pp. 23- 36 ,(1988) , 10.1109/2.16
Jiubin Ju, Gaochao Xu, Jie Tao, Parallel computing using idle workstations ACM SIGOPS Operating Systems Review. ,vol. 27, pp. 87- 96 ,(1993) , 10.1145/155870.155883