STABILIZER: Statistically Sound Performance Evaluation

DOI:

关键词:

摘要: Researchers and software developers require effective performance evaluation. Researchers must evaluate optimizations or measure overhead. Software developers use automatic performance regression tests to discover when changes improve or degrade performance. The standard methodology is to compare execution times before and after applying changes. Unfortunately, modern architectural features make this approach unsound. Statistically sound evaluation requires multiple samples to test whether one can or cannot (with high confidence) reject the null hypothesis that results are the same before and after. However, caches and branch predictors make performance dependent on machine-specific parameters and the exact layout of code, stack frames, and heap objects. A single binary constitutes just one sample from the space of program layouts, regardless of the number of runs. Since compiler …

acm.org 本地加速

psu.edu PDF 下载加速

参考文章(0)

STABILIZER: Statistically Sound Performance Evaluation

来源期刊

我的账户

STABILIZER: Statistically Sound Performance Evaluation

来源期刊

相似文章 0

我的账户