GLUnix: a global layer Unix for a network of workstations

作者: Douglas P. Ghormley , David Petrou , Steven H. Rodrigues , Amin M. Vahdat , Thomas E. Anderson

DOI: 10.1002/(SICI)1097-024X(19980725)28:9<929::AID-SPE183>3.0.CO;2-C

关键词:

摘要: Recent improvements in network and workstation performance have made clusters an attractive architecture for diverse workloads, including interactive sequential parallel applications. Although viable hardware solutions are available today, the largest challenge making such a cluster usable lies system software. This paper describes design implementation of GLUnix, operating middleware workstations. GLUnix was designed to provide transparent remote execution, support jobs, load ballancing, backward compatibility existing application binaries. constructed be easily portable number platforms. has been daily use over two half years is currently running on 100-node Sun UltraSPARCs. relates our experiences with designing, building, GLUnix. We discuss three important tradeoffs faced by any system, present reasons choices. Each these decisions then re-evaluated light both experience recent technological advancements. describe user-level, centralized, event-driven highlight aspects implementation. Performance scalability measurements indicate that user-level can scale gracefully significant sizes, incurring only additional 220 μs overhead per node execution. The discussion focuses successes failures we encountered while building maintaining characterization limitations various features were added satisfy user community. © 1998 John Wiley & Sons, Ltd.

参考文章(37)
Neil Webber, Operating System Support for Portable Filesystem Extensions. USENIX Winter. pp. 219- 228 ,(1993)
James S. Plank, Kai Li, Micah Beck, Gerry Kingsley, Libckpt: transparent checkpointing under Unix usenix annual technical conference. pp. 18- 18 ,(1995)
John K. Ousterhout, Scheduling Techniques for Concurrent Systems. international conference on distributed computing systems. pp. 22- 30 ,(1982)
R. Srinivasan, XDR: External Data Representation Standard RFC. ,vol. 1832, pp. 1- 24 ,(1995)
Vlada Matena, Moti Thadani, Ken Shirriff, Yousef A. Khalidi, Jose M. Bernabeu, Solaris MC: A Multi-Computer OS usenix annual technical conference. pp. 16- 16 ,(1996)
Nicholas Carriero, David Gelernter, Linda in context Communications of the ACM. ,vol. 32, pp. 444- 458 ,(1989) , 10.1145/63334.63337
Thomas E. Anderson, Brian N. Bershad, Edward D. Lazowska, Henry M. Levy, Scheduler activations ACM Transactions on Computer Systems. ,vol. 10, pp. 53- 79 ,(1992) , 10.1145/146941.146944
Andrea C. Dusseau, Remzi H. Arpaci, David E. Culler, Effective distributed scheduling of parallel workloads measurement and modeling of computer systems. ,vol. 24, pp. 25- 36 ,(1996) , 10.1145/233008.233020
Remzi H. Arpaci, David E. Culler, Arvind Krishnamurthy, Steve G. Steinberg, Katherine Yelick, Empirical evaluation of the CRAY-T3D: a compiler perspective international symposium on computer architecture. ,vol. 23, pp. 320- 331 ,(1995) , 10.1145/223982.224443
J.P. Verboncoeur, A.B. Langdon, N.T. Gladd, An object-oriented electromagnetic PIC code Computer Physics Communications. ,vol. 87, pp. 199- 211 ,(1995) , 10.1016/0010-4655(94)00173-Y