Load Shedding in XML Streams

作者: Murali Mani , Mingzhu Wei , Elke A. Rundensteiner

DOI:

关键词:

摘要: Because of the high volume and unpredictability arrival data streams, stream processing systems may not always be able to keep up with input — resulting in buffer overflow uncontrolled loss data. Load shedding, prevalent strategy for solving this problem, has todate been considered relational engines. On other hand face additional challenges opportunities ”structural shedding”, due complex nested XML result structures. We now tackle open shedding problem by a three-pronged solution. First, we develop preference model XQuery enable users specify relative importance preserving different subpattern structure. This transforms into rewriting user query possibly several queries that return approximate answers yet highest possible utility as measured given model. Two, cost compare both performance alternate queries. Third,we propose two solutions: OptShed, FastShed. OptShed guarantees find an optimal solution however at exponential complexity. FashShed confirmed our experiments, efficiently achieves close-to-optimal wide range cases. Lastly describe in-automaton mechanism Raindrop system. The experimental results show proposed preference-driven solutions consistently achieve higher compared existing “relational” techniques.

参考文章(25)
Bernd Hafenrichter, Werner Kießling, Optimizing Preference Queries for Personalized Web Services. communications, internet, and information technology. pp. 461- 466 ,(2002)
Denilson Barbosa, Kelly A. Lyons, John Keenleyside, Alberto O. Mendelzon, ToXgene: An extensible template-based data generator for XML. international workshop on the web and databases. pp. 49- 54 ,(2002)
Rajeev Motwani, Mayur Datar, Brian Babcock, Load Shedding Techniques for Data Stream Systems ,(2003)
Nesime Tatbul, Michael Stonebraker, Mitch Cherniack, Stan Zdonik, Uğur Çetintemel, Load Shedding on Data Streams ,(2003)
Bertram Ludäscher, Pratik Mukhopadhyay, Yannis Papakonstantinou, A transducer-based XML query processor very large data bases. pp. 227- 238 ,(2002) , 10.1016/B978-155860869-6/50028-7
Werner Kießling, Gerhard Köstler, Preference SQL: design, implementation, experiences very large data bases. pp. 990- 1001 ,(2002) , 10.1016/B978-155860869-6/50098-6
Utkarsh Srivastava, Jennifer Widom, Memory-limited execution of windowed stream joins very large data bases. pp. 324- 335 ,(2004) , 10.1016/B978-012088469-8.50031-0
Peter C. Fishburn, Utility theory for decision making ,(1970)
Jennifer Widom, Gurmeet Singh Manku, Chris Olston, Rajeev Motwani, Mayur Datar, Brian Babcock, Justin Rosenstein, Shivnath Babu, Arvind Arasu, Rohit Varma, Query Processing, Approximation, and Resource Management in a Data Stream Management System. conference on innovative data systems research. ,(2003)
Silvano Martello, Paolo Toth, Algorithms for Knapsack Problems North-holland Mathematics Studies. ,vol. 132, pp. 213- 257 ,(1987) , 10.1016/S0304-0208(08)73237-7