Online event clustering in temporal dimension

作者: Hoang Thanh Lam , Eric Bouillet

DOI: 10.1145/2666310.2666393

关键词:

摘要: This work is motivated by a real-life application that exploits sensor data available from traffic light control systems currently deployed in many cities around the world. Each consists of an induction loop generates stream events triggered whenever metallic object e.g. car, bus, or bicycle, detected above sensor. Because red phase lights objects are usually divided into groups move together. Detecting these as long they pass through useful for estimating status toad networks such car queue length detecting anomalies. In this work, given contains observations event, detection moving object, together with timestamps indicating when happen, we study problem clusters real-time based on proximity event's occurrence time. We propose efficient algorithm scales up to large streams extracted thousands sensors city London. Moreover, our better than baseline algorithms terms clustering accuracy. demonstrate motivations showing use-case which results used lengths road and

参考文章(13)
Jean Bacon, Andrei Iu. Bejan, Alastair R. Beresford, David Evans, Richard J. Gibbens, Ken Moody, Using real-time road traffic data to evaluate congestion Dependable and Historic Computing. pp. 93- 117 ,(2011) , 10.1007/978-3-642-24541-1_9
L. O'Callaghan, N. Mishra, A. Meyerson, S. Guha, R. Motwani, Streaming-data algorithms for high-quality clustering international conference on data engineering. pp. 685- 694 ,(2002) , 10.1109/ICDE.2002.994785
Albert Bifet, Geoff Holmes, Bernhard Pfahringer, Jesse Read, Philipp Kranen, Hardy Kremer, Timm Jansen, Thomas Seidl, MOA: A Real-Time Analytics Open Source Framework Machine Learning and Knowledge Discovery in Databases. pp. 617- 620 ,(2011) , 10.1007/978-3-642-23808-6_41
Jürgen Beringer, Eyke Hüllermeier, Online clustering of parallel data streams data and knowledge engineering. ,vol. 58, pp. 180- 204 ,(2006) , 10.1016/J.DATAK.2005.05.009
Philipp Kranen, Ira Assent, Corinna Baldauf, Thomas Seidl, The ClusTree: indexing micro-clusters for anytime stream mining Knowledge and Information Systems. ,vol. 29, pp. 249- 272 ,(2011) , 10.1007/S10115-010-0342-8
Xuegang (Jeff) Ban, Peng Hao, Zhanbo Sun, Real time queue length estimation for signalized intersections using travel times from mobile sensors Transportation Research Part C-emerging Technologies. ,vol. 19, pp. 1133- 1156 ,(2011) , 10.1016/J.TRC.2011.01.002
Ryan Herring, Aude Hofleitner, Pieter Abbeel, Alexandre Bayen, Estimating arterial traffic conditions using sparse probe data international conference on intelligent transportation systems. pp. 929- 936 ,(2010) , 10.1109/ITSC.2010.5624994
Jonathan A. Silva, Elaine R. Faria, Rodrigo C. Barros, Eduardo R. Hruschka, André C. P. L. F. de Carvalho, João Gama, Data stream clustering ACM Computing Surveys. ,vol. 46, pp. 1- 31 ,(2013) , 10.1145/2522968.2522981
Tian Zhang, Raghu Ramakrishnan, Miron Livny, BIRCH: an efficient data clustering method for very large databases international conference on management of data. ,vol. 25, pp. 103- 114 ,(1996) , 10.1145/233269.233324
S. Guha, A. Meyerson, N. Mishra, R. Motwani, L. O'Callaghan, Clustering data streams: Theory and practice IEEE Transactions on Knowledge and Data Engineering. ,vol. 15, pp. 515- 528 ,(2003) , 10.1109/TKDE.2003.1198387