Anytime Algorithms for GPU Architectures

作者: Rahul Mangharam , Aminreza Abrahimi Saba

DOI: 10.1109/RTSS.2011.41

关键词: CUDAFlow networkInstrumentation (computer programming)Graphics processing unitAlgorithmTraffic congestionParallel algorithmComputer scienceAlgorithm designDistributed computingTardiness

摘要: Most algorithms are run-to-completion and provide one answer upon completion no if interrupted before completion. On the other hand, anytime have a monotonic increasing utility with length of execution time. Our investigation focuses on development time-bounded Graphics Processing Units (GPUs) to trade-off quality output Given time-varying workload, algorithm continually measures its progress remaining contract time decide pathway select system resources required maximize result. To exploit quality-time tradeoff, focus is construction, instrumentation, on-line measurement decision making capable efficiently managing GPU resources. We demonstrate this Parallel A* routing CUDA-enabled GPU. The resource usage described in terms CUDA kernels constructed at design-time. At runtime, selects subset composes them for feedback-control between GPU-CPU achieve controllable computation tardiness by throttling request admissions processing precision. As case study, we implemented AutoMatrix, GPU-based vehicle traffic simulator real-time congestion management which scales up 16 million vehicles US street map. This an early effort enable imprecise approximate parallel architectures stream-based timebounded applications such as prediction route allocation large transportation networks.

参考文章(23)
Adolf D. May, Traffic flow fundamentals ,(1989)
Joseph Y-T. Leung, A Survey of Scheduling Results for Imprecise Computation Tasks Springer, Boston, MA. pp. 35- 42 ,(1995) , 10.1007/978-0-585-26870-5_3
Pedro J. Martín, Roberto Torres, Antonio Gavilanes, CUDA Solutions for the SSSP Problem international conference on computational science. pp. 904- 913 ,(2009) , 10.1007/978-3-642-01970-8_91
Sasa Misailovic, Martin Rinard, Stelios Sidiroglou, Henry Hoffmann, Anant Agarwal, Using Code Perforation to Improve Performance, Reduce Energy Consumption, and Respond to Failures ,(2009)
Pawan Harish, P. J. Narayanan, Accelerating Large Graph Algorithms on the GPU Using CUDA High Performance Computing – HiPC 2007. pp. 197- 208 ,(2007) , 10.1007/978-3-540-77220-0_21
Michael P. Wellman, Chao-Lin Liu, State-Space Abstraction for Anytime Evaluation of Probabilistic Networks Uncertainty Proceedings 1994. pp. 567- 574 ,(1994) , 10.1016/B978-1-55860-332-5.50077-8
Shlomo Zilberstein, Stuart J. Russell, Anytime Sensing, Planning and Action: A Practical Model for Robot Control international joint conference on artificial intelligence. pp. 1402- 1407 ,(1993)
Thomas Dean, Mark Boddy, Solving time-dependent planning problems international joint conference on artificial intelligence. pp. 979- 984 ,(1989)
Eric J. Horvitz, Gregory F. Cooper, Jaap Suermondt, Bounded Conditioning: Flexible Inference for Decisions under Scarce Resources arXiv: Artificial Intelligence. pp. 182- 193 ,(2013)
Jason Ansel, Cy Chan, Yee Lok Wong, Marek Olszewski, Qin Zhao, Alan Edelman, Saman Amarasinghe, PetaBricks Proceedings of the 2009 ACM SIGPLAN conference on Programming language design and implementation - PLDI '09. ,vol. 44, pp. 38- 49 ,(2009) , 10.1145/1542476.1542481