Large Scale Estimation in Cyberphysical Systems using Streaming Data: a Case Study with Smartphone Traces

作者: Alexandre M. Bayen , Pieter Abbeel , Matei Zaharia , Tathagata Das , Timothy Hunter

DOI:

关键词:

摘要: Controlling and analyzing cyberphysical robotics systems is increasingly becoming a Big Data challenge. Pushing this data to, processing in the cloud more efficient than on-board processing. However, current cloud-based solutions are not suitable for latency requirements of these applications. We present new concept, Discretized Streams or D-Streams, that enables massively scalable computations on streaming with latencies as short second. experiment an implementation D-Streams top Spark computing framework. demonstrate usefulness concept novel algorithm to estimate vehicular traffic urban networks. Our online EM can very large city network (the San Francisco Bay Area) by tens thousands observations per second, few seconds. Note Practitioners This work was driven need at scale, setting, using commodity hardware. Machine Learning algorithms combined new, but it still requires deep expertise both Computer Systems achieve scale tractable manner. The Streaming project aims providing interface abstracts out all technical details computation platform (cloud, HPC, workstation, etc.). As shown work, imple- menting calibrating non-trivial cluster, provides intuitive yet powerful programming interface. readers invited refer source code referred article examples. presents sample compute densities Gamma random variables restricted hyperplane (i.e. distributions form Tij P j jTj = d Tj independant distributions). It common case use Gaussian because closed solve. If one considers positive valued heavy tails, our formulas gamma may be suitable.

参考文章(22)
Xuegang(Jeff) Ban, Ryan Herring, J.D. Margulici, Alexandre M. Bayen, Optimal Sensor Placement for Freeway Travel Time Estimation Springer, Boston, MA. pp. 697- 721 ,(2009) , 10.1007/978-1-4419-0820-9_34
Tomio Miwa, Taka Morikawa, Takaaki Sakai, Route Identification and Travel Time Prediction Using Probe-Car Data International Journal of ITS Research, Vol.2, No.1, 2004, p.21-28. ,vol. 2, pp. 21- 28 ,(2004)
Nikolas Geroliminis, Alexander Skabardonis, Real-Time Estimation of Travel Times on Signalized Arterials Transportation and Traffic Theory. Flow, Dynamics and Human Interaction. 16th International Symposium on Transportation and Traffic TheoryUniversity of Maryland, College Park. pp. 387- 406 ,(2005)
Jaliya Ekanayake, Hui Li, Bingjing Zhang, Thilina Gunarathne, Seung-Hee Bae, Judy Qiu, Geoffrey Fox, Twister: a runtime for iterative MapReduce high performance distributed computing. pp. 810- 818 ,(2010) , 10.1145/1851476.1851593
P. G. Moschopoulos, The distribution of the sum of independent gamma random variables Annals of the Institute of Statistical Mathematics. ,vol. 37, pp. 541- 544 ,(1985) , 10.1007/BF02481123
A. Hofleitner, A. Bayen, Optimal decomposition of travel times measured by probe vehicles using a statistical traffic flow model international conference on intelligent transportation systems. pp. 815- 821 ,(2011) , 10.1109/ITSC.2011.6083050
A. P. Dempster, N. M. Laird, D. B. Rubin, Maximum Likelihood from Incomplete Data Via theEMAlgorithm Journal of the Royal Statistical Society: Series B (Methodological). ,vol. 39, pp. 1- 22 ,(1977) , 10.1111/J.2517-6161.1977.TB01600.X
Michael James Lighthill, Gerald Beresford Whitham, On kinematic waves II. A theory of traffic flow on long crowded roads Proceedings of The Royal Society A: Mathematical, Physical and Engineering Sciences. ,vol. 229, pp. 317- 345 ,(1955) , 10.1098/RSPA.1955.0089
D. B. Work, S. Blandin, O. P. Tossavainen, B. Piccoli, A. M. Bayen, A traffic model for velocity data assimilation Applied Mathematics Research Express. ,vol. 2010, pp. 1- 35 ,(2010) , 10.1093/AMRX/ABQ002
Xuegang (Jeff) Ban, Ryan Herring, Peng Hao, Alexandre M. Bayen, Delay Pattern Estimation for Signalized Intersections Using Sampled Travel Times Transportation Research Record. ,vol. 2130, pp. 109- 119 ,(2009) , 10.3141/2130-14