Semi-Supervised Classification Method Based on Spectral Clustering

作者: Xi Chen

DOI: 10.4304/JNW.9.2.384-392

关键词:

摘要: With the rapid development of data collection and storage technology, there are plentiful unlabeled but very few often expensive labeled in real-word applications. Thus, semi-supervised learning algorithms have attracted much attention. In this paper, we propose a new classification algorithm benefiting from spectral clustering called SC-SSL. First, introduce to partition all into clusters. Second, build classifier using predict probabilities (weights) classes that each instance belongs for cluster. Third, cluster, add those instances whose labels with maximum weights as same cluster label data. Fourth, terms set, reconstruct classifier. We repeat above processing steps 2 3 till meeting stopping condition. Finally, extensive experiments reveal our SC-SSL can sufficiently use information get robust by clustering, it maintains higher accuracy compared several well known algorithms.

参考文章(28)
Akinori Fujino, Naonori Ueda, Kazumi Saito, A hybrid generative/discriminative approach to semi-supervised classifier design national conference on artificial intelligence. pp. 764- 769 ,(2005)
Leonard E. Trigg, Mark A. Hall, Ian H. Witten, Eibe Frank, Sally Jo Cunningham, Geoffrey Holmes, Weka: Practical machine learning tools and techniques with Java implementations ,(1999)
Yu-Feng Li, Zhi-Hua Zhou, Improving semi-supervised support vector machines through unlabeled instances selection national conference on artificial intelligence. pp. 386- 391 ,(2011)
Kurt Driessens, Peter Reutemann, Bernhard Pfahringer, Claire Leschi, Using Weighted Nearest Neighbor to Benefit from Unlabeled Data pacific-asia conference on knowledge discovery and data mining. ,vol. 3918, pp. 60- 69 ,(2006) , 10.1007/11731139_10
Yangqiu Song, Wen-Yen Chen, Hongjie Bai, Chih-Jen Lin, Edward Y. Chang, Parallel Spectral Clustering european conference on machine learning. pp. 374- 389 ,(2008) , 10.1007/978-3-540-87481-2_25
Sotiris B. Kotsiantis, Supervised Machine Learning: A Review of Classification Techniques Informatica (lithuanian Academy of Sciences). ,vol. 31, pp. 249- 268 ,(2007)
Roman Filipovych, Christos Davatzikos, Alzheimer's Disease Neuroimaging Initiative, Semi-supervised pattern classification of medical images: Application to mild cognitive impairment (MCI) NeuroImage. ,vol. 55, pp. 1109- 1119 ,(2011) , 10.1016/J.NEUROIMAGE.2010.12.066
Chotirat "ann" Ratanamahatana, Dimitrios Gunopulos, Feature selection for the naive bayesian classifier using decision trees Applied Artificial Intelligence. ,vol. 17, pp. 475- 487 ,(2003) , 10.1080/713827175
王玲, 薄列峰, 焦李成, None, Density-Sensitive Semi-Supervised Spectral Clustering Journal of Software. ,vol. 18, pp. 2412- ,(2007) , 10.1360/JOS182412
Fanhua Shang, Yuanyuan Liu, Fei Wang, Learning Spectral Embedding for Semi-supervised Clustering international conference on data mining. pp. 597- 606 ,(2011) , 10.1109/ICDM.2011.89