Tuple-Independent Representations of Infinite Probabilistic Databases.

作者: Martin Grohe , Peter Lindner , Nofar Carmeli , Christoph Standke

DOI:

关键词:

摘要: Probabilistic databases (PDBs) are probability spaces over database instances. They provide a framework for handling uncertainty in databases, as occurs due to data integration, noisy data, from unreliable sources or randomized processes. Most of the existing theory literature investigated finite, tuple-independent PDBs (TI-PDBs) where occurrences tuples independent events. Only recently, Grohe and Lindner (PODS '19) introduced independence assumptions beyond finite domain assumption. In major argument discussing theoretical properties TI-PDBs is that they can be used represent any PDB via views. This no longer case once number countably infinite. this paper, we systematically study representability infinite terms related block-independent disjoint PDBs. The central question which representable first-order views We give necessary condition sufficient criterion distribution PDB. With various examples, explore limits our criteria. show conditioning on first order yields additional power expressivity. Finally, discuss relation between purely logical arithmetic reasons (non-)representability.

参考文章(43)
Bhaskara Marthi, Brian Milch, Daniel L. Ong, Andrey Kolobov, Stuart Russell, David Sontag, BLOG: probabilistic models with unknown objects international joint conference on artificial intelligence. pp. 1352- 1359 ,(2005)
Parag Singla, Pedro Domingos, Markov logic in infinite domains uncertainty in artificial intelligence. pp. 368- 375 ,(2007)
Nir Friedman, Daniel L. Koller, Probabilistic graphical models : principles and techniques The MIT Press. ,(2009)
Jennifer Widom, Parag Agrawal, Continuous Uncertainty in Trio. MUD. pp. 17- 32 ,(2009)
Victor Vianu, Serge Abiteboul, Richard Hull, Foundations of databases ,(1994)
Bernd Gutmann, Manfred Jaeger, Luc De Raedt, Extending ProbLog with Continuous Distributions Inductive Logic Programming. ,vol. 6489, pp. 76- 91 ,(2011) , 10.1007/978-3-642-21295-6_12
Lise Getoor, Prithviraj Sen, Amol Deshpande, PrDB: managing and exploiting rich correlations in probabilistic databases very large data bases. ,vol. 18, pp. 1065- 1090 ,(2009) , 10.1007/S00778-009-0153-2
Tomasz Imieliński, Witold Lipski, Incomplete Information in Relational Databases Journal of the ACM. ,vol. 31, pp. 761- 791 ,(1984) , 10.1145/1634.1886
Serge Abiteboul, T.-H. HUBERT Chan, Evgeny Kharlamov, Werner Nutt, Pierre Senellart, Capturing continuous data and answering aggregate queries in probabilistic XML ACM Transactions on Database Systems. ,vol. 36, pp. 1- 45 ,(2011) , 10.1145/2043652.2043658