Low-impact profiling of streaming, heterogeneous applications

作者: Joseph Marion Lancaster , Roger Chamberlain

DOI: 10.7936/K7K935KF

关键词:

摘要: Computer engineers are continually faced with the task of translating improvements in fabrication process technology (i.e., Moore's Law) into architectures that allow computer scientists to accelerate application performance. As feature-size continues shrink, architects commodity processors designing increasingly more cores on a chip. While additional can operate independently some tasks (e.g. OS and user tasks), many applications see little no improvement from adding processor alone. For applications, heterogeneous systems offer path toward higher Significant performance power gains have been realized by combining specialized (e.g., Field-Programmable Gate Arrays, Graphics Processing Units) general purpose multi-core processors. Heterogeneous need be programmed differently than traditional software. One approach, stream processing, fits these particularly well because segmented memories explicit expression parallelism. Unfortunately, debugging tools support streaming, do not exist. This dissertation presents TimeTrial, measurement system enables optimization streaming profiling deployed system. TimeTrial performs low-impact measurements dedicating computing resources monitoring aggressively compressing traces statistical summaries guided specification queries interest.

参考文章(80)
Ian A. Troxel, Alan D. George, Ryan A. DeVille, Performance Monitoring for Run-time Management of Reconfigurable Devices. ERSA. pp. 175- 181 ,(2005)
Jeremy D. Buhler, Joseph M. Lancaster, D Roger, Arpith C. Jacob, Mercury BLASTN: Faster DNA Sequence Comparison using a Streaming Hardware Architecture ,(2007)
William Thies, Michal Karczmarek, Saman Amarasinghe, StreamIt: A Language for Streaming Applications compiler construction. pp. 179- 196 ,(2002) , 10.1007/3-540-45937-5_14
Gilles Kahn, The Semantics of a Simple Language for Parallel Programming. ifip congress. pp. 471- 475 ,(1974)
D.R. Horn, M. Houston, P. Hanrahan, ClawHMMER: A Streaming HMMer-Search Implementatio conference on high performance computing (supercomputing). pp. 11- 11 ,(2005) , 10.1109/SC.2005.18
J. Buhler, M. Franklin, P. Krishnamurthy, J. Lancaster, R. Chamberlain, M. Gyang, Biosequence similarity search on the Mercury system application-specific systems, architectures, and processors. pp. 365- 375 ,(2004) , 10.1109/ASAP.2004.12
D. Zaretsky, G. Mittal, Xiaoyong Tang, P. Banerjee, Overview of the FREEDOM compiler for mapping DSP software to FPGAs field-programmable custom computing machines. pp. 37- 46 ,(2004) , 10.1109/FCCM.2004.44
Greg Morrisett, Lewis Girod, Sam Madden, Ryan Newton, Michael Craig, WaveScript: A Case-Study in Applying a Distributed Stream-Processing Language ,(2008)