Prediction-based geometric monitoring over distributed data streams

作者: Nikos Giatrakos , Antonios Deligiannakis , Minos Garofalakis , Izchak Sharfman , Assaf Schuster

DOI: 10.1145/2213836.2213867

关键词: EstimatorRangingCurrent (mathematics)Function (mathematics)Computer scienceData miningData stream miningDomain (software engineering)Data setTransmission (telecommunications)

摘要: Many modern streaming applications, such as online analysis of financial, network, sensor and other forms data are inherently distributed in nature. An important query type that is the focal point application scenarios regards actuation queries, where proper action dictated based on a trigger condition placed upon current value monitored function receives. Recent work studies problem (non-linear) sophisticated tracking manner. The main concept behind geometric monitoring approach proposed there, for each site to perform over an appropriate subset input domain. In work, we examine whether mechanism can become more efficient, terms number communicated messages, by extending framework utilize prediction models. We initially describe local estimators (predictors) useful applications consider which have already been shown particularly past work. then demonstrate feasibility incorporating predictors show prediction-based fact generalizes original framework. propose large variety different models threshold complex functions. Our extensive experimentation with real sets, functions parameter settings indicates our approaches provide significant communication savings ranging between two times up three orders magnitude, compared transmission cost

参考文章(21)
Abhinandan Das, Sumit Ganguly, Minos Garofalakis, Rajeev Rastogi, Distributed set-expression cardinality estimation very large data bases. pp. 312- 323 ,(2004) , 10.1016/B978-012088469-8.50030-9
Graham Cormode, Minos Garofalakis, Approximate continuous querying over distributed streams ACM Transactions on Database Systems. ,vol. 33, pp. 1- 39 ,(2008) , 10.1145/1366102.1366106
Qi Zhang, Jinze Liu, Wei Wang, Approximate Clustering on Distributed Data Streams international conference on data engineering. pp. 1131- 1139 ,(2008) , 10.1109/ICDE.2008.4497522
Graham Cormode, Minos Garofalakis, Streaming in a connected world Proceedings of the 2007 ACM SIGMOD international conference on Management of data - SIGMOD '07. pp. 1178- 1181 ,(2007) , 10.1145/1247480.1247649
Antonios Deligiannakis, Yannis Kotidis, Nick Roussopoulos, Compressing historical information in sensor networks international conference on management of data. pp. 527- 538 ,(2004) , 10.1145/1007568.1007628
Ke Yi, Qin Zhang, Optimal tracking of distributed heavy hitters and quantiles Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems - PODS '09. pp. 167- 174 ,(2009) , 10.1145/1559795.1559820
Guy Sagy, Daniel Keren, Izchak Sharfman, Assaf Schuster, Distributed threshold querying of general functions by a difference of monotonic representation Proceedings of the VLDB Endowment. ,vol. 4, pp. 46- 57 ,(2010) , 10.14778/1921071.1921072
Izchak Sharfman, Assaf Schuster, Daniel Keren, Shape sensitive geometric monitoring Proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems - PODS '08. pp. 301- 310 ,(2008) , 10.1145/1376916.1376958
Chris Olston, Jing Jiang, Jennifer Widom, Adaptive filters for continuous queries over distributed data streams international conference on management of data. pp. 563- 574 ,(2003) , 10.1145/872757.872825
Graham Cormode, S. Muthukrishnan, Wei Zhuang, Conquering the Divide: Continuous Clustering of Distributed Data Streams international conference on data engineering. pp. 1036- 1045 ,(2007) , 10.1109/ICDE.2007.368962