Transforming Heterogeneous Data with Database Middleware: Beyond Integration.

作者: Edward L. Wimmers , Renée J. Miller , Peter M. Schwarz , Mary Tork Roth , Laura M. Haas

DOI:

关键词:

摘要: Many applications today need information from diverse data sources, in which related may be represented quite differently. In one common scenario, a DBA wants to add new source an existing warehouse. The the not match warehouse schema. also partially redundant with that warehouse, or formatted Other integrate more dynamically, response user queries. Even using single often want present it form other than is stored in. For example, publish some particular XML DTD, though form. each of these scenarios, sets must mapped into target representation. Needed transformations include schema (changing structure data) [BLN86, RR98] and transformation cleansing format vocabulary eliminating at least reducing duplicates errors) [Val, ETI, ME97, HS95]. area, there broad range possible transformations, simple complex. Schema have typically been studied separately. We believe they handled together via uniform mechanism. Database middleware systems [PGMW95, TRV96, ACPS96, Bon95] multiple sources. To effective, such provide integrated schemas, able transform different sources answer queries against power their query engines ability connect several makes them natural base for doing complex as well. this paper, we look database tranformation engines, discuss when how transformed users need.

参考文章(19)
Sudha Ram, V. Ramesh, Schema integration: past, present, and future Management of heterogeneous and autonomous database systems. pp. 119- 155 ,(1998)
Steven Geffner, Divakant Agrawal, Amr El Abbadi, The Dynamic Data Cube extending database technology. pp. 237- 253 ,(2000) , 10.1007/3-540-46439-5_17
Fereidoon Sadri, Laks V. S. Lakshmanan, Iyer N. Subramanian, SchemaSQL - A Language for Interoperability in Relational Multi-Database Systems very large data bases. pp. 239- 250 ,(1996)
Peter M. Schwarz, Mary Tork Roth, Don't Scrap It, Wrap It! A Wrapper Architecture for Legacy Data Sources very large data bases. pp. 266- 275 ,(1997)
Kenneth A. Ross, Relations with relation names as arguments Proceedings of the eleventh ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems - PODS '92. pp. 346- 353 ,(1992) , 10.1145/137097.137905
Mauricio A. Hernández, Salvatore J. Stolfo, The merge/purge problem for large databases international conference on management of data. ,vol. 24, pp. 127- 138 ,(1995) , 10.1145/223784.223807
S. Geffner, D. Agrawal, A. El Abbadi, T. Smith, Relative prefix sums: an efficient approach for querying dynamic OLAP data cubes international conference on data engineering. pp. 328- 335 ,(1999) , 10.1109/ICDE.1999.754948
William W. Cohen, Integration of heterogeneous databases without common domains using queries based on textual similarity Proceedings of the 1998 ACM SIGMOD international conference on Management of data - SIGMOD '98. ,vol. 27, pp. 201- 212 ,(1998) , 10.1145/276304.276323
Ching-Tien Ho, Rakesh Agrawal, Nimrod Megiddo, Ramakrishnan Srikant, Range queries in OLAP data cubes international conference on management of data. ,vol. 26, pp. 73- 88 ,(1997) , 10.1145/253260.253274