Investigation of effect of reducing dataset's size on classification algorithms

作者: Neelam Singhal , Mohd. Ashraf

DOI:

关键词:

摘要: Data mining is now one of the most active field research. Extracting those nuggets information becoming crucial and its important technique classification. It helps to group data in some predefined classes. Various techniques for classification exists which classifies using different algorithms. Each algorithm has own area best worst performance. This paper concentrates on four famous algorithms, i.e., Decision Tree, Naive Bayes, K Nearest Neighbour Genetic Programming effect their performance time accuracy when number instances are incrementally decreased. will also investigate difference result working with binary class or multiclass datasets suggest algorithms follow certain kind dataset.

参考文章(2)
Radhika Kotecha, Vijay Ukani, Sanjay Garg, An empirical analysis of multiclass classification techniques in data mining nirma university international conference on engineering. pp. 1- 5 ,(2011) , 10.1109/NUICONE.2011.6153244
Mark A. Hall, Ian H. Witten, Eibe Frank, Christopher J. Pal, Data Mining, Fourth Edition: Practical Machine Learning Tools and Techniques Morgan Kaufmann Publishers Inc.. ,(2016)