Bootstrapping training-data representations for inductive learning: a case study in molecular biology

作者: Nathalie Japkowicz , Haym Hirsh

DOI:

关键词:

摘要: This paper describes a "bootstrapping" approach to the engineering of appropriate training-data representations for inductive learning. The central idea is begin with an initial set human-created features and then generate additional that have syntactic forms are similar human-engineered features. More specifically, we describe two-stage process good learning: first, generating by hand (usually in consultation domain experts) seem help learning, second, off these developing applying operators new look syntactically like expert-based Our experiments DNA sequence identification show successful representation data can be expanded this fashion yield dramatically improved results

参考文章(12)
James P. Callan, Paul E. Utgoff, Constructive induction on domain information national conference on artificial intelligence. pp. 614- 619 ,(1991)
Tom E. Fawcett, Paul E. Utgoff, Automatic Feature Generation for Problem Solving Systems international conference on machine learning. pp. 144- 153 ,(1992) , 10.1016/B978-1-55860-247-2.50024-3
Sholom M. Weiss, Computer systems that learn ,(1990)
J. G. Carbonell, T. M. Mitchell, R. S. Michalski, Machine Learning: An Artificial Intelligence Approach Springer Publishing Company, Incorporated. ,(2013)
Peter Clark, Robin Boswell, Rule induction with CN2: Some recent improvements Lecture Notes in Computer Science. pp. 151- 163 ,(1991) , 10.1007/BFB0017011
Ryszard S. Michalski, A theory and methodology of inductive learning Computer Compacts. ,vol. 1, pp. 49- ,(1983) , 10.1016/0167-7136(83)90132-4
John Current, Hasan Pirkul, Theory and methodology European Journal of Operational Research. ,vol. 51, pp. 338- 347 ,(1991) , 10.1016/0377-2217(91)90309-J
David B. Searls, The computational linguistics of biological sequences Artificial intelligence and molecular biology. pp. 47- 120 ,(1993)
H. Hirsh, M. Noordewier, Using background knowledge to improve inductive learning of DNA sequences conference on artificial intelligence for applications. pp. 351- 357 ,(1994) , 10.1109/CAIA.1994.323654
J. Ross Quinlan, C4.5: Programs for Machine Learning ,(1992)