Why off-the-shelf RDBMSs are better at XPath than you might expect

作者: Torsten Grust , Jan Rittinger , Jens Teubner

DOI: 10.1145/1247480.1247591

关键词:

摘要: To compensate for the inherent impedance mismatch between relational data model (tables of tuples) and XML (ordered, unranked trees), tree join algorithms have become prevalent means to process in databases, most notably TwigStack[6], structural join[1], staircase join[13] algorithms. However, addition these existing systems depends on a significant invasion underlying database kernel, an option intolerable vendors. Here, we demonstrate that can achieve comparable XPath performance without touching heart system. We carefully exploit functionality accelerate navigation by purely means: partitioned B-trees bring access costs secondary storage minimum, while aggregation functions avoid expensive computation removal duplicate result nodes comply with semantics. Experiments carried out IBM DB2 confirm our approach turn off-the-shelf into efficient processors.

参考文章(22)
Jennifer Widom, Jason McHugh, Query Optimization for XML very large data bases. pp. 315- 326 ,(1999)
Shankar Pal, Istvan Cseri, Oliver Seeliger, Gideon Schaller, Leo Giakoumakis, Vasili Zolotov, Indexing XML data stored in a relational database very large data bases. pp. 1146- 1157 ,(2004) , 10.1016/B978-012088469-8.50099-1
Chun Zhang, Jeffrey Naughton, David DeWitt, Qiong Luo, Guy Lohman, On supporting containment queries in relational database management systems international conference on management of data. ,vol. 30, pp. 425- 436 ,(2001) , 10.1145/375663.375722
Rudolf Bayer, Karl Unterauer, Prefix B-trees ACM Transactions on Database Systems. ,vol. 2, pp. 11- 26 ,(1977) , 10.1145/320521.320530
Nicolas Bruno, Nick Koudas, Divesh Srivastava, Holistic twig joins Proceedings of the 2002 ACM SIGMOD international conference on Management of data - SIGMOD '02. pp. 310- 321 ,(2002) , 10.1145/564691.564727
Torsten Grust, Manuel Mayr, Jan Rittinger, Sherif Sakr, Jens Teubner, A SQL Proceedings of the 2007 ACM SIGMOD international conference on Management of data - SIGMOD '07. pp. 1162- 1164 ,(2007) , 10.1145/1247480.1247642
Igor Tatarinov, Stratis D. Viglas, Kevin Beyer, Jayavel Shanmugasundaram, Eugene Shekita, Chun Zhang, Storing and querying ordered XML using a relational database system Proceedings of the 2002 ACM SIGMOD international conference on Management of data - SIGMOD '02. pp. 204- 215 ,(2002) , 10.1145/564691.564715
Quanzhong Li, Bongki Moon, Indexing and Querying XML Data for Regular Path Expressions very large data bases. pp. 361- 370 ,(2001)
Patrick O'Neil, Elizabeth O'Neil, Shankar Pal, Istvan Cseri, Gideon Schaller, Nigel Westbury, ORDPATHs Proceedings of the 2004 ACM SIGMOD international conference on Management of data - SIGMOD '04. pp. 903- 908 ,(2004) , 10.1145/1007568.1007686