Classification performance of mathematical programming techniques in discriminant analysis: Results for small and medium sample sizes

作者: Antonie Stam , Dennis G. Jones

DOI: 10.1002/MDE.4090110406

关键词: MathematicsFunction (mathematics)Nonparametric statisticsLinear discriminant analysisQuadratic functionOptimal discriminant analysisSample size determinationParametric statisticsMathematical optimizationVariablesStatisticsManagement of Technology and InnovationManagement Science and Operations ResearchStrategy and ManagementBusiness and International Management

摘要: The performance on small and medium-size samples of several techniques to solve the classification problem in discriminant analysis is investigated. considered are two widely used parametric statistical (Fisher's linear function Smith's quadratic function), a class recently proposed nonparametric estimation based mathematical programming (linear mixed-integer programming). A simulation study performed, analyzing relative above two-group case, for various sample sizes, moderate group overlap across six different data conditions. Training as well validation assess classificatory techniques. degree sizes selected this paper interest practice because they closely reflect conditions many real sets. results experiment show that nonlinear tends be superior training when variances–covariances groups heterogeneous, while technique performs best equal, with equal variances discrete uniform independent variables. also found more sensitive than other size, giving disproportionally inaccurate samples.

参考文章(32)
Gary J. Koehler, S. Selcuk Erenguc, Minimizing Misclassifications in Linear Discriminant Analysis Decision Sciences. ,vol. 21, pp. 63- 85 ,(1990) , 10.1111/J.1540-5915.1990.TB00317.X
Carol A. Markowski, Edward P. Markowski, An experimental comparison of several approaches to the discriminant problem with both qualitative and quantitative variables European Journal of Operational Research. ,vol. 28, pp. 74- 78 ,(1987) , 10.1016/0377-2217(87)90170-6
Erich A. Joachimsthaler, Antonie Stam, FOUR APPROACHES TO THE CLASSIFICATION PROBLEM IN DISCRIMINANT ANALYSIS: AN EXPERIMENTAL STUDY* Decision Sciences. ,vol. 19, pp. 322- 333 ,(1988) , 10.1111/J.1540-5915.1988.TB00270.X
Tom M. Cavalier, James P. Ignizio, Allen L. Soyster, Discriminant analysis via mathematical programming: certain problems and their causes Computers & Operations Research. ,vol. 16, pp. 353- 362 ,(1989) , 10.1016/0305-0548(89)90007-5
R. J. McKay, N. A. Campbell, Variable selection techniques in discriminant analysis: II. Allocation British Journal of Mathematical and Statistical Psychology. ,vol. 35, pp. 30- 41 ,(1982) , 10.1111/J.2044-8317.1982.TB00639.X
R. A. FISHER, THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS Annals of Human Genetics. ,vol. 7, pp. 179- 188 ,(1936) , 10.1111/J.1469-1809.1936.TB02137.X
Ching-Tsao Tu, Chien-Pai Han, Discriminant analysis based on binary and continuous variables Journal of the American Statistical Association. ,vol. 77, pp. 447- 454 ,(1982) , 10.1080/01621459.1982.10477831
Robert S. Kaplan, Gabriel Urwitz, Statistical Models of Bond Ratings: A Methodological Inquiry The Journal of Business. ,vol. 52, pp. 231- 261 ,(1979) , 10.1086/296045