作者: Brent N. Chun , David E. Culler
DOI: 10.1007/10720115_1
关键词:
摘要: Bringing clusters of computers into the mainstream as general-purpose computing systems requires that better facilities for transparent remote execution parallel and sequential applications be developed. While much research has been done in this area, most work remains inaccessible built using contemporary hardware operating systems. Implementations are either too old and/or not publicly available, require use which supported by modern hardware, or simply do meet functional requirements demanded practical real world settings. To address these issues, we designed REXEC, a decentralized, secure facility. It provides high availability, scalability, execution, dynamic cluster configuration, decoupled node discovery selection, well-defined failure cleanup model, distributed program support, strong authentication encryption. The system is implemented currently installed on 32-node 2-way SMPs running Linux 2.2.5 system.