Processing Exact Results for Windowed Stream Joins in a Memory-Limited System: A Disk-Based, Adaptive Approach

作者: Abhirup Chakraborty , Ajit Singh

DOI: 10.1007/978-3-642-35332-1_2

关键词:

摘要: We consider the problem of processing exact results for sliding window joins over data streams with limited memory. Existing approaches either, (1) deal memory limitations by shedding loads, and therefore cannot provide or even highly accurate showing time-varying rate arrivals, (2) suffer from large I/O overhead due to random disk flushes disk-to-disk stages a stream join, making inefficient handle joins. an Adaptive, Hash-partitioned Exact Window Join (AH-EWJ) algorithm incorporating storage as archive. Our spills onto on periodic basis, refines output result properly retrieving disk-resident data, maximizes employing techniques manage blocks, continuously adjusting allocated within windows. The managing blocks in memory—similar nature caching issue—captures both temporal frequency related properties arrivals. present baseline called Rate-based Progressive Joins (RPWJ), which extends existing tune performance reducing while experimental demonstrating effectiveness proposed algorithm.

参考文章(29)
Sunita Sarawagi, Vinayak R. Borkar, Kaustubh Deshmukh, Automatically Extracting Structure from Free Text Addresses. IEEE Data(base) Engineering Bulletin. ,vol. 23, pp. 27- 32 ,(2000)
Nesime Tatbul, Uğur Çetintemel, Stan Zdonik, Mitch Cherniack, Michael Stonebraker, Load shedding in a data stream manager very large data bases. pp. 309- 320 ,(2003) , 10.1016/B978-012722442-8/50035-5
Utkarsh Srivastava, Jennifer Widom, Memory-limited execution of windowed stream joins very large data bases. pp. 324- 335 ,(2004) , 10.1016/B978-012088469-8.50031-0
Abhirup Chakraborty, Ajit Singh, A disk-based, adaptive approach to memory-limited computation of windowed stream joins database and expert systems applications. pp. 251- 260 ,(2010) , 10.1007/978-3-642-15364-8_19
M.F. Mokbel, M. Lu, W.G. Aref, Hash-merge join: a non-blocking join algorithm for producing fast and early join results international conference on data engineering. pp. 251- 262 ,(2004) , 10.1109/ICDE.2004.1320002
Amol Deshpande, Joseph M. Hellerstein, Vijayshankar Raman, Samuel Madden, Mehul A. Shah, Sirish Chandrasekaran, Michael J. Franklin, Kris Hildrum, Adaptive Query Processing: Technology in Evolution. IEEE Data(base) Engineering Bulletin. ,vol. 23, pp. 7- 18 ,(2000)
Jens-Peter Dittrich, Bernhard Seeger, David Scot Taylor, Peter Widmayer, Progressive merge join: a generic and non-blocking sort-based join algorithm very large data bases. pp. 299- 310 ,(2002) , 10.1016/B978-155860869-6/50034-2
Adegoke Ojewole, Qiang Zhu, Wen-Chi Hou, Window join approximation over data streams with importance semantics Proceedings of the 15th ACM international conference on Information and knowledge management - CIKM '06. pp. 112- 121 ,(2006) , 10.1145/1183614.1183635
Rajeev Motwani, Dilys Thomas, Caching queues in memory buffers symposium on discrete algorithms. pp. 541- 549 ,(2004) , 10.5555/982792.982872
Sirish Chandrasekaran, Mehul A. Shah, Owen Cooper, Amol Deshpande, Michael J. Franklin, Joseph M. Hellerstein, Wei Hong, Sailesh Krishnamurthy, Samuel R. Madden, Fred Reiss, TelegraphCQ: continuous dataflow processing international conference on management of data. pp. 668- 668 ,(2003) , 10.1145/872757.872857