Clustering on Sliding Windows in Polylogarithmic Space.

作者: Vladimir Braverman , Harry Lang , Keith Levin , Morteza Monemizadeh

DOI: 10.4230/LIPICS.FSTTCS.2015.350

关键词: Cluster analysisMetric (mathematics)Sublinear functionBounded functionSliding window protocolPolynomialSpace (mathematics)Window (computing)CombinatoricsMathematics

摘要: In PODS 2003, Babcock, Datar, Motwani and O'Callaghan gave the first streaming solution for k-median problem on sliding windows using O(frack k tau^4 W^2tau log^2 W) space, with a O(2^O(1/tau)) approximation factor, where W is window size tau in (0,1/2) user-specified parameter. They left as an open question whether it possible to improve this polylogarithmic space. Despite much progress clustering windows, has remained more than decade. In paper, we partially answer main posed by O'Callaghan. We present algorithm yielding exponential improvement space compared previous result given et al. particular, give (alpha,beta)-approximation metric model, alpha beta are constants, under assumption, also made Babcock al., that optimal cost any bounded polynomial size. justify assumption showing when size, no sublinear possible. Our technical contribution simple but elegant extension of smooth functions introduced Braverman Ostrovsky, which allows us apply well-known techniques solving problems model to not smooth, such cost.

参考文章(34)
Pranjal Awasthi, Maria-Florina Balcan, Center Based Clustering: A Foundational Perspective ,(2014)
Michael S. Crouch, Andrew McGregor, Daniel Stubbs, Dynamic Graphs in the Sliding-Window Model european symposium on algorithms. pp. 337- 348 ,(2013) , 10.1007/978-3-642-40450-4_29
Sariel Har-Peled, Soham Mazumdar, Coresets for $k$-Means and $k$-Median Clustering and their Applications. arXiv: Computational Geometry. ,(2018)
Jarosław Byrka, Thomas Pensyl, Khoa Trinh, Bartosz Rybicki, Aravind Srinivasan, An improved approximation for k-median, and positive correlation in budgeted optimization symposium on discrete algorithms. ,vol. 13, pp. 737- 756 ,(2015) , 10.5555/2722129.2722179
Dan Feldman, Amos Fiat, Micha Sharir, Coresets forWeighted Facilities and Their Applications foundations of computer science. pp. 315- 324 ,(2006) , 10.1109/FOCS.2006.22
Jon Louis Bentley, James B Saxe, Decomposable searching problems I. Static-to-dynamic transformation Journal of Algorithms. ,vol. 1, pp. 301- 358 ,(1980) , 10.1016/0196-6774(80)90015-2
Paul Beame, Raphael Clifford, Widad Machmouchi, Element Distinctness, Frequency Moments, and Sliding Windows 2013 IEEE 54th Annual Symposium on Foundations of Computer Science. pp. 290- 299 ,(2013) , 10.1109/FOCS.2013.39
Sariel Har-Peled, Akash Kushal, Smaller coresets for k-median and k-means clustering symposium on computational geometry. pp. 126- 134 ,(2005) , 10.1145/1064092.1064114
Pankaj K. Agarwal, Sariel Har-Peled, Kasturi R. Varadarajan, Approximating extent measures of points Journal of the ACM. ,vol. 51, pp. 606- 635 ,(2004) , 10.1145/1008731.1008736