Uncertainty Annotated Databases - A Lightweight Approach for Approximating Certain Answers

作者: Su Feng , Aaron Huber , Boris Glavic , Oliver Kennedy

DOI: 10.1145/3299869.3319887

关键词:

摘要: Certain answers are a principled method for coping with uncertainty that arises in many practical data management tasks. Unfortunately, this is expensive and may ex- clude useful (if uncertain) answers. Thus, users frequently resort to less approaches resolve uncertainty. In paper, we propose Uncertainty Annotated Databases (UA-DBs), which combine an under- over-approximation of certain achieve the reliability answers, performance classical database system. Furthermore, contrast prior work on UA-DBs higher utility by including some (explicitly marked) not certain. based incomplete K-relations, introduce generalize set-based notion databases much larger class models. Using implementation our approach, demonstrate experimentally it efficiently produces tight approximations high utility.

参考文章(42)
Val Tannen, Todd J. Green, Models for Incomplete and Probabilistic Information. IEEE Data(base) Engineering Bulletin. ,vol. 29, pp. 17- 24 ,(2006)
Ariel D. Fuxman, Renée J. Miller, First-Order Query Rewriting for Inconsistent Databases Database Theory - ICDT 2005. pp. 337- 351 ,(2004) , 10.1007/978-3-540-30570-5_23
Jef Wijsen, On the first-order expressibility of computing certain answers to conjunctive queries over uncertain databases Proceedings of the twenty-ninth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems of data - PODS '10. pp. 179- 190 ,(2010) , 10.1145/1807085.1807111
Robert Fink, Andrew Hogue, Dan Olteanu, Swaroop Rath, SPROUT2 Proceedings of the 2011 international conference on Management of data - SIGMOD '11. pp. 1299- 1302 ,(2011) , 10.1145/1989323.1989481
T. Imielinski, R. Vandermeyden, K.V. Vadaparty, Complexity tailored design: a new design methodology for databases with incomplete information symposium on the theory of computing. ,vol. 51, pp. 405- 432 ,(1995) , 10.1006/JCSS.1995.1079
Tomasz Imieliński, Witold Lipski, Incomplete Information in Relational Databases Journal of the ACM. ,vol. 31, pp. 761- 791 ,(1984) , 10.1145/1634.1886
Xu Chu, John Morcos, Ihab F. Ilyas, Mourad Ouzzani, Paolo Papotti, Nan Tang, Yin Ye, KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing international conference on management of data. pp. 1247- 1261 ,(2015) , 10.1145/2723372.2749431
Phokion G. Kolaitis, Enela Pema, A dichotomy in the complexity of consistent query answering for queries with two atoms Information Processing Letters. ,vol. 112, pp. 77- 85 ,(2012) , 10.1016/J.IPL.2011.10.018
Robert Fink, Jiewen Huang, Dan Olteanu, Anytime approximation in probabilistic databases very large data bases. ,vol. 22, pp. 823- 848 ,(2013) , 10.1007/S00778-013-0310-5
Egor V. Kostylev, Peter Buneman, Combining dependent annotations for relational algebra international conference on database theory. pp. 196- 207 ,(2012) , 10.1145/2274576.2274597