Optimal column subset selection for image classification by genetic algorithms

作者: Pavel Krömer , Jan Platoš , Jana Nowaková , Václav Snášel

DOI: 10.1007/S10479-016-2331-0

关键词:

摘要: Many problems in operations research can be solved by combinatorial optimization. Fixed-length subset selection is a family of optimization that involve set unique objects from larger superset. Feature selection, p-median problem, and column problem are three examples hard search for fixed-length subsets. Due to their high complexity, exact algorithms often infeasible solve real-world instances these approximate methods based on various heuristic metaheuristic (e.g. nature-inspired) approaches employed. Selecting subsets massive data matrices an important technique useful construction compressed representations low rank approximations high-dimensional data. Search optimal exactly k columns matrix, \(A^{m\times n}\), \(k < n\), well-known with practical implications processing mining. It used unsupervised feature dimensionality reduction, visualization, so on. A representation raw contribute, example, reduction algorithm training times supervised learning, elimination overfitting classification regression, facilitation better understanding, many other benefits. This paper proposes novel genetic the evaluates it series computational experiments image classification. The evaluation shows proposed modifications improve results obtained artificial evolution.

参考文章(39)
Annie S. Wu, Robert K. Lindsay, Rick L. Riolo, Empirical Observations on the Roles of Crossover and Mutation. ICGA. pp. 362- 369 ,(1997)
Christos Boutsidis, Anastasios Zouzias, Petros Drineas, Michael W. Mahoney, Stochastic Dimensionality Reduction for K-means Clustering ,(2011)
Andrew Czarn, Cara MacNish, Kaipillil Vijayan, Berwin Turlach, Statistical exploratory analysis of genetic algorithms: the influence of gray codes upon the difficulty of a problem australasian joint conference on artificial intelligence. pp. 1246- 1252 ,(2004) , 10.1007/978-3-540-30549-1_130
Ali Ghodsi, Mohamed S. Kamel, Ahmed K. Farahat, A Fast Greedy Algorithm for Generalized Column Subset Selection. arXiv: Data Structures and Algorithms. ,(2013)
James Franklin, The elements of statistical learning : data mining, inference,and prediction The Mathematical Intelligencer. ,vol. 27, pp. 83- 85 ,(2005) , 10.1007/BF02985802
Christos Boutsidis, Petros Drineas, Michael W. Mahoney, An improved approximation algorithm for the column subset selection problem symposium on discrete algorithms. pp. 968- 977 ,(2009) , 10.5555/1496770.1496875
A. Çivril, Column Subset Selection Problem is UG-hard Journal of Computer and System Sciences. ,vol. 80, pp. 849- 859 ,(2014) , 10.1016/J.JCSS.2014.01.004
Christos Boutsidis, Malik Magdon-Ismail, A note on sparse least-squares regression Information Processing Letters. ,vol. 114, pp. 273- 276 ,(2014) , 10.1016/J.IPL.2013.11.011