Discovering associations in XML data

作者: A. Meisels , M. Orlov , T. Maor

DOI: 10.1109/WISEW.2002.1177861

关键词: Probability distributionXml dataAssociation rule learningData contentXMLXML validationComputer scienceInferenceData mining

摘要: Knowledge inference from semi-structured data can utilize frequent sub structures, in addition to frequency of items.In fact, the working assumption present study is that sub-trees XML represent sets tags (objects) aremeaningfully associated. A method for extracting presented. It uses thresholds on frequenciesof paths and multiplicity data. The are extracted counted a procedure has O(n2) complexity. content sub-trees, form attribute values, cast tabular form. This enables search forassociations Thus, complete structure extract association rules semi-structureddata. large industrial example used demonstrate operation proposed method.

参考文章(11)
Kohei Maruyama, Kuniaki Uehara, Mining Association Rules from Semi-Structured Data. workshop on knowledge discovery and data mining. ,(2000)
Wai-ching Wong, Ada Wai-Chee Fu, Finding Structure and Characteristics of Web Documents for Classification. international conference on management of data. pp. 96- 105 ,(2000)
Ramakrishnan Srikant, Rakesh Agrawal, Fast algorithms for mining association rules very large data bases. pp. 580- 592 ,(1998)
Ramakrishnan Srikant, Rakesh Agrawal, Fast Algorithms for Mining Association Rules in Large Databases very large data bases. pp. 487- 499 ,(1994)
Kohei Maruyama, Kuniaki Uehara, Knowledge Integration of Rule Mining and Schema Discovering discovery science. pp. 285- 289 ,(2000) , 10.1007/3-540-44418-1_31
Ke Wang, Huiqing Liu, Discovering typical structures of documents: a road map approach international acm sigir conference on research and development in information retrieval. pp. 146- 154 ,(1998) , 10.1145/290941.290982
Ming-Syan Chen, Jong Soo Park, P.S. Yu, Efficient data mining for path traversal patterns IEEE Transactions on Knowledge and Data Engineering. ,vol. 10, pp. 209- 221 ,(1998) , 10.1109/69.683753
P. A. Laur, F. Masseglia, P. Poncelet, M. Teisseire, A General Architecture for Finding Structural Regularities on the Web artificial intelligence methodology systems applications. pp. 179- 188 ,(2000) , 10.1007/3-540-45331-8_17
Ke Wang, Huiqing Liu, Discovering structural association of semistructured data IEEE Transactions on Knowledge and Data Engineering. ,vol. 12, pp. 353- 371 ,(2000) , 10.1109/69.846290
Rakesh Agrawal, Tomasz Imieliński, Arun Swami, Mining association rules between sets of items in large databases Proceedings of the 1993 ACM SIGMOD international conference on Management of data - SIGMOD '93. ,vol. 22, pp. 207- 216 ,(1993) , 10.1145/170035.170072