Efficient Probabilistic Reverse k-Nearest Neighbors Query Processing on Uncertain Data

作者: Jiajia Li , Botao Wang , Guoren Wang

DOI: 10.1007/978-3-642-37487-6_34

关键词: Upper and lower boundsQuery optimizationComputer scienceData miningUncertain dataPruning (decision trees)Probabilistic logicProbabilistic databaseScalabilityk-nearest neighbors algorithm

摘要: A reverse k-nearest neighbors (RkNN) query returns all the objects that take object q as their k nearest neighbors. However, data are often uncertain in numerous applications. In this paper, we focus on problem of processing RkNN data. probabilistic (PRkNN) retrieves have higher probabilities than a user-specified threshold to be q. The previous work for answering PRNN mainly based distance relationship between objects, and inapplicable PRkNN when > 1. design novel algorithm support arbitrary values basis two pruning strategies, namely spatial pruning. rule is defined both distances angle ranges objects. And an efficient upper bound probability estimated by algorithm. Extensive experiments conducted study performance proposed approach. results show our has better scalability existing solution regarding growth k.

参考文章(23)
Ioana Stanoi, Amr El Abbadi, Divyakant Agrawal, Reverse Nearest Neighbor Queries for Dynamic Databases. international conference on management of data. pp. 44- 53 ,(2000)
Jennifer Widom, Trio: A System for Integrated Management of Data, Accuracy, and Lineage conference on innovative data systems research. pp. 262- 276 ,(2004)
Hans-Peter Kriegel, Peter Kunath, Matthias Renz, Probabilistic Nearest-Neighbor Query on Uncertain Objects Advances in Databases: Concepts, Systems and Applications. pp. 337- 348 ,(2007) , 10.1007/978-3-540-71703-4_30
Ioana Stanoi, Amr El Abbadi, Mirek Riedewald, Divyakant Agrawal, Discovery of Influence Sets in Frequently Updated Databases very large data bases. pp. 99- 108 ,(2001)
Yufei Tao, Dimitris Papadias, Xiang Lian, Reverse kNN search in arbitrary dimensionality very large data bases. pp. 744- 755 ,(2004) , 10.1016/B978-012088469-8.50066-8
Tobias Emrich, Hans-Peter Kriegel, Peer Kröger, Matthias Renz, Andreas Züfle, Boosting spatial pruning Proceedings of the 2010 international conference on Management of data - SIGMOD '10. pp. 39- 50 ,(2010) , 10.1145/1807167.1807174
Yongxin Tong, Lei Chen, Bolin Ding, Discovering Threshold-based Frequent Closed Itemsets over Probabilistic Data 2012 IEEE 28th International Conference on Data Engineering. pp. 270- 281 ,(2012) , 10.1109/ICDE.2012.51
Reynold Cheng, Jinchuan Chen, Mohamed Mokbel, Chi-Yin Chow, Probabilistic Verifiers: Evaluating Constrained Nearest-Neighbor Queries over Uncertain Data international conference on data engineering. pp. 973- 982 ,(2008) , 10.1109/ICDE.2008.4497506
Reynold Cheng, Lei Chen, Jinchuan Chen, Xike Xie, Evaluating probability threshold k-nearest-neighbor queries over uncertain data Proceedings of the 12th International Conference on Extending Database Technology Advances in Database Technology - EDBT '09. pp. 672- 683 ,(2009) , 10.1145/1516360.1516438
Yongxin Tong, Lei Chen, Yurong Cheng, Philip S. Yu, Mining frequent itemsets over uncertain databases Proceedings of the VLDB Endowment. ,vol. 5, pp. 1650- 1661 ,(2012) , 10.14778/2350229.2350277