Collaborative data sharing with mappings and provenance

作者: Zachary G. Ives , Val Tannen , Todd J. Green

DOI:

关键词: Relational databaseInformation retrievalMaterialized viewWorld Wide WebData modelingQuery optimizationData exchangeXMLComputer scienceData sharingData integration

摘要: A key challenge in science today involves integrating data from databases managed by different collaborating scientists. In this dissertation, we develop the foundations and applications of collaborative sharing systems (CDSSs), which address challenge. CDSS allows collaborators to define loose confederations heterogeneous databases, relating them through schema mappings that establish how should flow one site next. addition simply propagating along mappings, it is critical record provenance (annotations describing where originated) support policies allowing scientists specify whose they trust, when. Since a large confederation certain evolve over time, must also efficiently handle incremental changes data, schemas, mappings. We focus dissertation on formal CDSSs, as well practical issues its implementation prototype called Orchestra. We propose novel model appropriate for based framework semiring-annotated relations. This elegantly generalizes number other important database semantics involving annotated relations, including ranked results, prior models, probabilistic databases. describe design Orchestra prototype, supports update propagation across while maintaining filtering according trust policies. investigate fundamental questions query containment equivalence context information. use results these investigations approaches CDSS. Our highlight unexpected connections between two problems with problem optimizing queries using materialized views. Finally, show semiring annotations make sense XML nested relational paving way towards future extension richer models.

参考文章(129)
Zachary G. Ives, Val Tannen, Grigorios Karvounarakis, Provenance in collaborative data sharing University of Pennsylvania. ,(2009)
Fangqing Dong, Laks V. S. Lakshmanan, Deductive Databases with Incomplete Information. JICSLP. pp. 303- 317 ,(1992)
Peter Buneman, Val Tannen, A Structural Approach to Query Language Design The Functional Approach to Data Management. pp. 335- 367 ,(2004) , 10.1007/978-3-662-05372-0_14
Yehoshua Sagiv, Alberto O. Mendelzon, Divesh Srivastava, Alon Y. Levy, Answering Queries Using Views. symposium on principles of database systems. pp. 95- 104 ,(1995)
Fereidoon Sadri, Laks V. S. Lakshmanan, Probabilistic deductive databases international conference on logic programming. pp. 254- 268 ,(1994)
Jaroslav Nešetřil, Pavol Hell, Graphs and homomorphisms ,(2004)
Gösta Grahne, Nicolas Spyratos, Daniel Stamate, Semantics and containment of queries with internal and external conjunctions international conference on database theory. pp. 71- 82 ,(1997) , 10.1007/3-540-62222-5_37
Inderpal Singh Mumick, Query optimization in deductive and relational databases Stanford University. ,(1992)
Anastasios Kementsietsidis, Marcelo Arenas, Data sharing through query translation in autonomous sources very large data bases. pp. 468- 479 ,(2004) , 10.1016/B978-012088469-8.50043-7