ASPIRE: exploiting asynchronous parallelism in iterative algorithms using a relaxed consistency based DSM

作者: Keval Vora , Sai Charan Koduru , Rajiv Gupta

DOI: 10.1145/2660193.2660227

关键词:

摘要: Many vertex-centric graph algorithms can be expressed using asynchronous parallelism by relaxing certain read-after-write data dependences and allowing threads to compute vertex values stale (i.e., not the most recent) of their neighboring vertices. We observe that on distributed shared memory systems, converting synchronous into counterparts, made tolerant high inter-node communication latency. However, latency lead excessive use causing an increase in number iterations required converge. Although bounded staleness we restrict slowdown rate convergence, this also restricts ability tolerate In paper design a relaxed consistency model protocol simultaneously minimize values. This is achieved via coordinated best effort refresh policy staleness. demonstrate for range PDE solvers, average, our approach outperforms based upon: prior models allow at least 2.27x; Bulk Synchronous Parallel (BSP) 4.2x. show frequently GraphLab, popular processing framework.

参考文章(66)
Joseph E Gonzalez, Yucheng Low, Haijie Gu, Danny Bickson, Carlos Guestrin, None, PowerGraph: distributed graph-parallel computation on natural graphs operating systems design and implementation. pp. 17- 30 ,(2012) , 10.5555/2387880.2387883
Johannes Gehrke, Wenlei Xie, Guozhang Wang, Alan J. Demers, Asynchronous Large-Scale Graph Processing Made Easy conference on innovative data systems research. ,(2013)
Vadim Iosevich, Assaf Schuster, Distributed Shared Memory: To Relax or Not to Relax? Lecture Notes in Computer Science. pp. 198- 205 ,(2004) , 10.1007/978-3-540-27866-5_25
Zhiyi Huang, Martin Purvis, Byung-Hyun Yu, Stephen Cranefield, Homeless and home-based Lazy Release Consistency protocols on Distributed Shared Memory ACSC '04 Proceedings of the 27th Australasian conference on Computer science - Volume 26. pp. 117- 123 ,(2004)
Abdelsalam Heddaya, Himanshu Sinha, An Implementation of Mermera: A Shared Memory System that Mixes Coherence with Non-coherence Boston University. ,(1993)
Sandhya Dwarkadas, Alan L. Cox, Willy Zwaenepoel, Pete Keleher, TreadMarks: distributed shared memory on standard workstations and operating systems usenix winter technical conference. pp. 10- 10 ,(1994)
Xiaojin ZhuЃ, Zoubin GhahramaniЃн, None, Learning from labeled and unlabeled data with label propagation Center for Automated Learning and Discovery, CMU: Carnegie Mellon University, USA.. ,(2002)
Gregory R. Ganger, Eric Xing, Kimberly Keeton, Seunghak Lee, James Cipar, Qirong Ho, Garth Gibson, Jin Kyu Kim, Solving the straggler problem with bounded staleness hot topics in operating systems. pp. 22- 22 ,(2013)
Russell Meyers, Zhiyuan Li, ASYNC Loop Constructs for Relaxed Synchronization languages and compilers for parallel computing. pp. 292- 303 ,(2008) , 10.1007/978-3-540-89740-8_20