Mining molecular fragments: finding relevant substructures of molecules

作者: C. Borgelt , M.R. Berthold

DOI: 10.1109/ICDM.2002.1183885

关键词:

摘要: We present an algorithm to find fragments in a set of molecules that help discriminate between different classes for instance, activity drug discovery context. Instead carrying out brute-force search, our method generates by embedding them all appropriate parallel and prunes the search tree based on local order atoms bonds, which results substantially faster eliminating need frequent, computationally expensive reembeddings suppressing redundant search. prove usefulness demonstrating activity-related groups chemical compounds well-known National Cancer Institute's HIV-screening dataset.

参考文章(10)
Robert D. Clark, Relative and absolute diversity analysis of combinatorial libraries Ghose, Arup K [Editor, Reprint Author], Viswanadhan, Vellarkad N [Editor, Reprint Author] Combinatorial library design and evaluation. pp. 337- 362 ,(2001)
Srinivasan Parthasarathy, Mitsunori Ogihara, Mohammed J Zaki, Wei Li, New algorithms for fast discovery of association rules knowledge discovery and data mining. pp. 283- 286 ,(1997)
Christian Borgelt, Rudolf Kruse, Induction of Association Rules: Apriori Implementation COMPSTAT. pp. 395- 400 ,(2002) , 10.1007/978-3-642-57489-4_59
Jochen Hipp, Andreas Myka, Rüdiger Wirth, Ulrich Güntzer, A new algorithm for faster mining of generalized association rules Principles of Data Mining and Knowledge Discovery. pp. 74- 82 ,(1998) , 10.1007/BFB0094807
Owen S. Weislow, Rebecca Kiser, Donald L. Fine, John Bader, Robert H. Shoemaker, Michael R. Boyd, New Soluble-Formazan Assay for HIV-1 Cytopathic Effects: Application to High-Flux Screening of Synthetic and Natural Products for AIDS-Antiviral Activity Journal of the National Cancer Institute. ,vol. 81, pp. 577- 586 ,(1989) , 10.1093/JNCI/81.8.577
Stefan Kramer, Luc De Raedt, Christoph Helma, Molecular feature mining in HIV data knowledge discovery and data mining. pp. 136- 143 ,(2001) , 10.1145/502512.502533
John W. Raymond, Eleanor J. Gardiner, Peter Willett, Heuristics for similarity searching of chemical graphs using a maximum common edge subgraph algorithm Journal of Chemical Information and Computer Sciences. ,vol. 42, pp. 305- 316 ,(2002) , 10.1021/CI010381F
Wolf Jürgen Streich, Rainer Franke, Topological Pharmacophores New Methods and Their Application to a Set of Antimalarials Part 1: The Methods LOGANA and LOCON Quantitative Structure-activity Relationships. ,vol. 4, pp. 13- 18 ,(1985) , 10.1002/QSAR.19850040104
Mukund Deshpande, Michihiro Kuramochi, George Karypis, Automated approaches for classifying structures international conference on data mining. pp. 11- 18 ,(2002) , 10.21236/ADA439498
Rakesh Agrawal, Tomasz Imieliński, Arun Swami, Mining association rules between sets of items in large databases Proceedings of the 1993 ACM SIGMOD international conference on Management of data - SIGMOD '93. ,vol. 22, pp. 207- 216 ,(1993) , 10.1145/170035.170072