作者: J.S. Plank , W.R. Elwasif
关键词: Local area network 、 Condition monitoring 、 Rollback recovery 、 Computer science 、 System recovery 、 Experimental research 、 Running time 、 Distributed computing 、 Theoretical research 、 Workstation
摘要: In the past twenty years, there has been a wealth of theoretical research on minimizing expected running time program in presence failures by employing checkpointing and rollback recovery. same period, little experimental to corroborate these results. We study three separate projects that monitor failure workstation networks. Our goals are twofold. The first is see how results correlate with results, second assess their impact strategies for long-running computations workstations networks workstations. A significant result our work although base assumptions do not hold, many still applicable.