Bridging the GAP: towards approximate graph analytics

作者: Anand Padmanabha Iyer , Aurojit Panda , Shivaram Venkataraman , Mosharaf Chowdhury , Aditya Akella

DOI: 10.1145/3210259.3210269

关键词: Graph analyticsGraphGraph (abstract data type)AnalyticsGraph propertyComputer scienceBig dataTheoretical computer science

摘要: While there has been a tremendous interest in processing data that an underlying graph structure, existing distributed systems take several minutes or even hours to execute popular algorithms. However, cases, providing approximate answer is good enough. Approximate analytics seeing considerable attention big due its ability produce timely results by trading accuracy, but they do not support analytics. In this paper, we bridge gap and first attempt at realizing We discuss how traditional techniques carry over the usecase. Leveraging characteristics of properties algorithms, propose sparsification technique, machine learning based approach choose apt amount required meet given budget. Our preliminary evaluations show encouraging results.

参考文章(25)
Joseph E Gonzalez, Yucheng Low, Haijie Gu, Danny Bickson, Carlos Guestrin, None, PowerGraph: distributed graph-parallel computation on natural graphs operating systems design and implementation. pp. 17- 30 ,(2012) , 10.5555/2387880.2387883
Johannes Gehrke, Wenlei Xie, Guozhang Wang, Alan J. Demers, Asynchronous Large-Scale Graph Processing Made Easy conference on innovative data systems research. ,(2013)
Reynold S. Xin, Ion Stoica, Joseph E. Gonzalez, Daniel Crankshaw, Ankur Dave, Michael J. Franklin, GraphX: graph processing in a distributed dataflow framework operating systems design and implementation. pp. 599- 613 ,(2014) , 10.5555/2685048.2685096
Adam Wierman, Minlan Yu, Ganesh Ananthanarayanan, Ion Stoica, Michael Chien-Chun Hung, Xiaoqi Ren, GRASS: trimming stragglers in approximation analytics networked systems design and implementation. pp. 289- 302 ,(2014) , 10.5555/2616448.2616475
Guy Blelloch, Aapo Kyrola, Carlos Guestrin, GraphChi: large-scale graph computation on just a PC operating systems design and implementation. ,vol. 2012, pp. 31- 46 ,(2012) , 10.5555/2387880.2387884
Danny Bickson, Aapo Kyrola, Carlos Guestrin, Joseph Hellerstein, Yucheng Low, Joseph Gonzalez, GraphLab: a new framework for parallel machine learning uncertainty in artificial intelligence. pp. 340- 349 ,(2010)
Peter Macko, Virendra J. Marathe, Daniel W. Margo, Margo I. Seltzer, LLAMA: Efficient graph analytics using Large Multiversioned Arrays international conference on data engineering. pp. 363- 374 ,(2015) , 10.1109/ICDE.2015.7113298
Scott Beamer, Krste Asanovic, David Patterson, Locality Exists in Graph Processing: Workload Characterization on an Ivy Bridge Server ieee international symposium on workload characterization. pp. 56- 65 ,(2015) , 10.1109/IISWC.2015.12
Anand Iyer, Li Erran Li, Ion Stoica, None, CellIQ: real-time cellular network analytics at scale networked systems design and implementation. pp. 309- 322 ,(2015)
P. Boldi, S. Vigna, The webgraph framework I Proceedings of the 13th conference on World Wide Web - WWW '04. pp. 595- 602 ,(2004) , 10.1145/988672.988752