An efficient query platform for streaming and dynamic natural graphs

作者: Milindu Sanoj Kumarage , Yasanka Horawalavithana , D.N. Ranasinghe

DOI: 10.1109/ICIINFS.2017.8300418

关键词:

摘要: Massive scale data streaming is now prevalent and can be used to dynamically build large graphs which are then efficiently analyzable for insightful information. In situations where real-time analytics required approximate outcomes within time bounds may desirable. We have identified graph summarization TCM sketching in particular as a good technique data. provides set of metrics such Average Relative Error, Number Effective Queries, Effectiveness Queries Confusion Matrix queries on graphs. propose extensions the model automatic sketch creation while being constructed evaluate approach with different policies query combinations. The proposed framework works well 80% 90% efficiency ±3 deviations from exact results.

参考文章(22)
Joseph E Gonzalez, Yucheng Low, Haijie Gu, Danny Bickson, Carlos Guestrin, None, PowerGraph: distributed graph-parallel computation on natural graphs operating systems design and implementation. pp. 17- 30 ,(2012) , 10.5555/2387880.2387883
Yuanyuan Tian, Andrey Balmin, Severin Andreas Corsten, Shirish Tatikonda, John McPherson, From "think like a vertex" to "think like a graph" Proceedings of the VLDB Endowment. ,vol. 7, pp. 193- 204 ,(2013) , 10.14778/2732232.2732238
Rajeev Motwani, Terry Winograd, Lawrence Page, Sergey Brin, The PageRank Citation Ranking : Bringing Order to the Web the web conference. ,vol. 98, pp. 161- 172 ,(1999)
Reynold S Xin, Joseph E Gonzalez, Michael J Franklin, Ion Stoica, EECS AMPLab, GraphX: a resilient distributed graph system on Spark First International Workshop on Graph Data Management Experiences and Systems. pp. 2- ,(2013) , 10.1145/2484425.2484427
Michael Mitzenmacher, A Brief History of Generative Models for Power Law and Lognormal Distributions Internet Mathematics. ,vol. 1, pp. 226- 251 ,(2004) , 10.1080/15427951.2004.10129088
Kook Jin Ahn, Sudipto Guha, Andrew McGregor, Graph sketches Proceedings of the 31st symposium on Principles of Database Systems - PODS '12. pp. 5- 14 ,(2012) , 10.1145/2213556.2213560
Charalampos Tsourakakis, Christos Gkantsidis, Bozidar Radunovic, Milan Vojnovic, FENNEL: streaming graph partitioning for massive scale graphs web search and data mining. pp. 333- 342 ,(2014) , 10.1145/2556195.2556213
Andrew McGregor, Graph stream algorithms: a survey international conference on management of data. ,vol. 43, pp. 9- 20 ,(2014) , 10.1145/2627692.2627694
Gurmeet Singh Manku, Rajeev Motwani, Approximate frequency counts over data streams Proceedings of the VLDB Endowment. ,vol. 5, pp. 1699- 1699 ,(2012) , 10.14778/2367502.2367508