Reevaluating Data Stall Time with the Consideration of Data Access Concurrency

作者: Yu-Hang Liu , Xian-He Sun

DOI: 10.1007/S11390-015-1517-2

关键词:

摘要: Data access delay has become the prominent performance bottleneck of high-end computing systems. The key to reducing data in system design is diminish stall time. Memory locality and concurrency are two essential factors influencing modern memory However, existing studies time rarely focus on utilizing because impact overall not well understood. In this study, a pair novel models, L-C model for combined effort P-M effect pure miss time, presented. models provide new understanding directions optimization. Based these summary table advanced cache optimizations It 38 entries contributed by while only 21 locality, which shows value concurrency. their associated results opportunities introduced study important necessary future data-centric architecture algorithm

参考文章(21)
John L. Hennessy, David A. Patterson, Computer Architecture: A Quantitative Approach ,(1989)
Miquel Moreto, Francisco J. Cazorla, Alex Ramirez, Mateo Valero, MLP-aware dynamic cache partitioning high performance embedded architectures and compilers. pp. 337- 352 ,(2008) , 10.1007/978-3-540-77560-7_23
Alain Ketterlin, Philippe Clauss, Profiling Data-Dependence to Assist Parallelization: Framework, Scope, and Optimization international symposium on microarchitecture. pp. 437- 448 ,(2012) , 10.1109/MICRO.2012.47
Roman Iakymchuk, Paolo Bientinesi, Modeling performance through memory-stalls ACM SIGMETRICS Performance Evaluation Review. ,vol. 40, pp. 86- 91 ,(2012) , 10.1145/2381056.2381076
Cloyce D. Spradling, SPEC CPU2006 benchmark tools ACM SIGARCH Computer Architecture News. ,vol. 35, pp. 130- 134 ,(2007) , 10.1145/1241601.1241625
Wm. A. Wulf, Sally A. McKee, Hitting the memory wall ACM SIGARCH Computer Architecture News. ,vol. 23, pp. 20- 24 ,(1995) , 10.1145/216585.216588
Xian-He Sun, Dawei Wang, APC ACM SIGMETRICS Performance Evaluation Review. ,vol. 40, pp. 125- 130 ,(2012) , 10.1145/2381056.2381082
Dawei Wang, Xian-He Sun, APC: A Novel Memory Metric and Measurement Methodology for Modern Memory Systems IEEE Transactions on Computers. ,vol. 63, pp. 1626- 1639 ,(2014) , 10.1109/TC.2013.38
Yue Wu, Yun-Ji Chen, Tian-Shi Chen, Qi Guo, Lei Zhang, An Elastic Architecture Adaptable to Various Application Scenarios Journal of Computer Science and Technology. ,vol. 29, pp. 227- 238 ,(2014) , 10.1007/S11390-014-1425-X
Shekhar Borkar, Andrew A. Chien, The future of microprocessors Communications of The ACM. ,vol. 54, pp. 67- 77 ,(2011) , 10.1145/1941487.1941507