Robust multiple-instance learning ensembles using random subspace instance selection

作者: Marc-André Carbonneau , Eric Granger , Alexandre J. Raymond , Ghyslain Gagnon

DOI: 10.1016/J.PATCOG.2016.03.035

关键词:

摘要: Many real-world pattern recognition problems can be modeled using multiple-instance learning (MIL), where instances are grouped into bags, and each bag is assigned a label. State-of-the-art MIL methods provide high level of performance when strong assumptions made regarding the underlying data distributions, proportion positive to negative in bags. In this paper, new method called Random Subspace Instance Selection (RSIS) proposed for robust design ensembles without any prior on structure First, instance selection probabilities computed based training clustered random subspaces. A pool classifiers then generated subsets created with these probabilities. By RSIS, more many distributions noise, not adversely affected by bags because repeatedly selected probabilistic manner. Moreover, RSIS also allows identification an individual basis, as required practical applications. Results obtained several synthetic databases show robustness designed over range witness rates, noisy features compared reference literature. HighlightsA method, Selection, ensembles.The yields that variations rate, noise.The state-of-the-art results benchmark sets.

参考文章(52)
Luc De Raedt, Jan Ramon, Multi instance neural networks international conference on machine learning. pp. 53- 60 ,(2000)
Ken Lang, NewsWeeder: Learning to Filter Netnews Machine Learning Proceedings 1995. pp. 331- 339 ,(1995) , 10.1016/B978-1-55860-377-6.50048-7
Foster Provost, R Fawcett, T, Kohavi, The Case against Accuracy Estimation for Comparing Induction Algorithms international conference on machine learning. pp. 445- 453 ,(1998)
Zhi-Hua Zhou, Min-Ling Zhang, Ensembles of multi-instance learners european conference on machine learning. pp. 492- 502 ,(2003) , 10.1007/978-3-540-39857-8_44
Peter A. Flach, Thomas Gärtner, Alex J. Smola, Adam Kowalczyk, Multi-Instance Kernels international conference on machine learning. pp. 179- 186 ,(2002)
Nils Weidmann, Eibe Frank, Bernhard Pfahringer, A two-level learning method for generalized multi-instance problems european conference on machine learning. pp. 468- 479 ,(2003) , 10.1007/978-3-540-39857-8_42
Charles X. Ling, Jin Huang, Harry Zhang, AUC: a better measure than accuracy in comparing learning algorithms Lecture Notes in Computer Science. pp. 329- 341 ,(2003) , 10.1007/3-540-44886-1_25
Jason Fritts, Sally A. Goldman, Qi Zhang, Wei Yu, Content-Based Image Retrieval Using Multiple-Instance Learning international conference on machine learning. pp. 682- 689 ,(2002)
Xin Xu, Eibe Frank, Logistic Regression and Boosting for Labeled Bags of Instances Advances in Knowledge Discovery and Data Mining. pp. 272- 281 ,(2004) , 10.1007/978-3-540-24775-3_35
Peter V. Gehler, Olivier Chapelle, Deterministic Annealing for Multiple-Instance Learning international conference on artificial intelligence and statistics. pp. 123- 130 ,(2007)