Substructure discovery using minimum description length and background knowledge

作者: D. J. Cook , L. B. Holder

DOI: 10.1613/JAIR.43

关键词: MathematicsMinimum description lengthMeasure (mathematics)SubstructureComponent (UML)Domain (software engineering)Molecule miningVariety (cybernetics)Data miningCloseness

摘要: The ability to identify interesting and repetitive substructures is an essential component discovering knowledge in structural data. We describe a new version of our SUBDUE substructure discovery system based on the minimum description length principle. discovers that compress original data represent concepts By replacing previously-discovered data, multiple passes produce hierarchical regularities uses computationally-bounded inexact graph match identifies similar, but not identical, instances finds approximate measure closeness two when under computational constraints. In addition minimumdescription principle, other background can be used by guide search towards more appropriate substructures. Experiments variety domains demonstrate SUBDUE's find capable compressing discover important domain.

参考文章(26)
KEVIN THOMPSON, PAT LANGLEY, Concept formation in structured domains Concept formation knowledge and experience in unsupervised learning. pp. 127- 161 ,(1991) , 10.1016/B978-1-4832-0773-5.50011-0
Edwin P. D. Pednault, Some experiments in applying inductive inference principles to surface reconstruction international joint conference on artificial intelligence. pp. 1603- 1609 ,(1989)
Stephen C.-Y. Lu, R. Bharat Rao, Learning engineering models with the minimum description length principle national conference on artificial intelligence. pp. 717- 722 ,(1992)
Robert Levinson, A self-organizing retrieval system for graphs national conference on artificial intelligence. pp. 203- 206 ,(1984)
Eric Jeltsch, Hans-Jörg Kreowski, Grammatical Inference Based on Hyperedge Replacement international workshop on graph grammars and their application to computer science. pp. 461- 474 ,(1990) , 10.1007/BFB0017406
Patrick H. Winston, Learning Structural Descriptions From Examples The Psychology of Computer Vision. ,(1970)
Darrell Conklin, Janice Glasgow, Spatial analogy and subsumption international conference on machine learning. pp. 111- 116 ,(1992) , 10.1016/B978-1-55860-247-2.50019-X
Ryszard S. Michalski, Robert E. Stepp, Learning from Observation: Conceptual Clustering Machine Learning. pp. 331- 363 ,(1983) , 10.1007/978-3-662-12405-5_11