A hardware-driven profiling scheme for identifying program hot spots to support runtime optimization

作者: Wen-mei W. Hwu , John C. Gyllenhaal , Andrew R. Trick , Matthew C. Merten , Christopher N. George

DOI: 10.1145/300979.300991

关键词:

摘要: This paper presents a novel hardware-based approach for identifying, profiling, and monitoring hot spots in order to support runtime optimization of general purpose programs. The proposed consists set tightly coupled hardware tables control logic modules that are placed the retirement stage processor pipeline removed from critical path. features design include rapid detection program after changes execution behavior, runtime-tunable selection criteria spot detection, negligible overhead during application execution. Experiments using several SPEC95 benchmarks, as well large WindowsNT applications, demonstrate promise design.

参考文章(16)
Brian Lee Deitrich, Wen-Mei W. Hwu, Static program analysis to enhance profile independence in instruction-level parallelism compilation University of Illinois at Urbana-Champaign. ,(1998)
Thomas Kistler, Dynamic Runtime Optimization JMLC '97 Proceedings of the Joint Modular Languages Conference on Modular Programming Languages. pp. 53- 66 ,(1997) , 10.1007/3-540-62599-2_30
Thomas M. Conte, Burzin A. Patel, Kishore N. Menezes, J. Stan Cox, Hardware-based profiling: an effective technique for profile-driven optimization International Journal of Parallel Programming. ,vol. 24, pp. 187- 206 ,(1996) , 10.1007/BF03356747
Thomas Ball, James R. Larus, Branch prediction for free Proceedings of the ACM SIGPLAN 1993 conference on Programming language design and implementation - PLDI '93. ,vol. 28, pp. 300- 313 ,(1993) , 10.1145/155090.155119
Glenn Ammons, Thomas Ball, James R. Larus, Exploiting hardware performance counters with flow and context sensitive profiling programming language design and implementation. ,vol. 32, pp. 85- 96 ,(1997) , 10.1145/258915.258924
Joel Auslander, Matthai Philipose, Craig Chambers, Susan J. Eggers, Brian N. Bershad, Fast, effective dynamic compilation Proceedings of the ACM SIGPLAN 1996 conference on Programming language design and implementation - PLDI '96. ,vol. 31, pp. 149- 159 ,(1996) , 10.1145/231379.231409
Brian N. Bershad, Patrick J. Crowley, Thomas E. Anderson, Dennis C. Lee, Jean-Loup Baer, Execution characteristics of desktop applications on Windows NT international symposium on computer architecture. ,vol. 26, pp. 27- 38 ,(1998) , 10.1145/279358.279366
Jeffrey Dean, James E Hicks, Carl A Waldspurger, William E Weihl, George Chrysos, None, ProfileMe: hardware support for instruction-level profiling on out-of-order processors international symposium on microarchitecture. pp. 292- 302 ,(1997) , 10.5555/266800.266828
Jennifer M. Anderson, Lance M. Berc, Jeffrey Dean, Sanjay Ghemawat, Monika R. Henzinger, Shun-Tak A. Leung, Richard L. Sites, Mark T. Vandevoorde, Carl A. Waldspurger, William E. Weihl, Continuous profiling: where have all the cycles gone? symposium on operating systems principles. ,vol. 15, pp. 1- 14 ,(1997) , 10.1145/265924.265925
Xiaolan Zhang, Zheng Wang, Nicholas Gloy, J. Bradley Chen, Michael D. Smith, System support for automatic profiling and optimization symposium on operating systems principles. ,vol. 31, pp. 15- 26 ,(1997) , 10.1145/268998.266640