作者: G. D. C. Cavalcanti , T. I. Ren , B. A. Vale
关键词: Artificial intelligence 、 Machine learning 、 Large margin nearest neighbor 、 Pattern recognition 、 Data set 、 Classifier (UML) 、 Data complexity 、 Computer science 、 k-nearest neighbors algorithm
摘要: The classifier accuracy is affected by the properties of data sets used to train it. Nearest neighbor classifiers are known for being simple and accurate in several domains, but their behavior strongly dependent on complexity. On other hand, there complexity measures which aim describe sets. This work aims show how can be efficiently predict Neighbor classifier. Seven seventeen real datasets experimental study. Each measure analyzed individually order find a relationship between its value given dataset. No single good enough However, combination these provides powerful tool