作者: Olli-Pekka Lehto
DOI: 10.1007/978-3-642-36803-5_22
关键词:
摘要: This paper introduces Numprof, a profiling framework for performance analysis of numerical libraries. The consists profiler and replayer the BLAS FFTW3 records library call events with user configurable amount detail. can be used to execute calls based on trace files generated by profiler. We explore real-world use cases demonstrate that due its low overhead it is feasible continuous statistical calls.