作者: Guanpeng Li , Qining Lu , Karthik Pattabiraman
DOI: 10.1109/DSN.2015.36
关键词: Long latency 、 Program code 、 Real-time computing 、 Static analysis 、 Fault injection 、 Software fault tolerance 、 Latency (engineering) 、 Computer science 、 Crash 、 Software
摘要: As the rate of transient hardware faults increases, researchers have investigated software techniques to tolerate these faults. An important class are those that cause long- latency crashes (LLCs), or can persist for a long time in program before causing it crash. In this paper, we develop technique automatically find locations where LLC originate so be protected bound program's crash latency. We first identify code patterns responsible majority through an empirical study. then build CRASHFINDER, tool finds by statically searching patterns, and refining static analysis results with dynamic selective fault injection-based approach. CRASHFINDER achieve average 9.29 orders magnitude reduction more than 90% program, compared exhaustive injection techniques, has no false-positives.