Diagnosis of colorectal cancer by near-infrared optical fiber spectroscopy and random forest

作者: Hui Chen , Zan Lin , Hegang Wu , Li Wang , Tong Wu

DOI: 10.1016/J.SAA.2014.07.005

关键词: ChemometricsSpectroscopyAnalytical chemistryCluster analysisNear-infrared spectroscopyArtificial intelligenceRandom forestIn situPrincipal component analysisFourier transformChemistryPattern recognitionInstrumentation (computer programming)Atomic and Molecular Physics, and Optics

摘要: Near-infrared (NIR) spectroscopy has such advantages as being noninvasive, fast, relatively inexpensive, and no risk of ionizing radiation. Differences in the NIR signals can reflect many physiological changes, which are turn associated with factors vascularization, cellularity, oxygen consumption, or remodeling. spectral differences between colorectal cancer healthy tissues were investigated. A Fourier transform instrument equipped a fiber-optic probe was used to mimic situ clinical measurements. total 186 spectra collected then underwent preprocessing standard normalize variate (SNV) for removing unwanted background variances. All specimen spots collection confirmed staining examination by an experienced pathologist so ensure representative pathology. Principal component analysis (PCA) uncover possible clustering. Several methods including random forest (RF), partial least squares-discriminant (PLSDA), K-nearest neighbor classification regression tree (CART) extract features construct diagnostic models. By comparison, it reveals that, even if obvious difference misclassified ratio (MCR) observed these models, RF is preferable since quicker, more convenient insensitive over-fitting. The results indicate that coupled model serve potential tool discriminating from normal ones.

参考文章(29)
Davide Ballabio, Viviana Consonni, Classification tools in chemistry. Part 1: linear models. PLS-DA Analytical Methods. ,vol. 5, pp. 3790- 3798 ,(2013) , 10.1039/C3AY40582F
Liwen Liang, Bin Wang, Ye Guo, Hong Ni, Yulin Ren, A support vector machine-based analysis method with wavelet denoised near-infrared spectroscopy Vibrational Spectroscopy. ,vol. 49, pp. 274- 277 ,(2009) , 10.1016/J.VIBSPEC.2008.10.008
Venkata Radhakrishna Kondepati, Michael Keese, Ralf Mueller, Bernd Christoph Manegold, Juergen Backhaus, Application of near-infrared spectroscopy for the diagnosis of colorectal cancer in resected human tissue specimens Vibrational Spectroscopy. ,vol. 44, pp. 236- 242 ,(2007) , 10.1016/J.VIBSPEC.2006.12.001
Vladimir Svetnik, Andy Liaw, Christopher Tong, J. Christopher Culberson, Robert P. Sheridan, Bradley P. Feuston, Random forest: a classification and regression tool for compound classification and QSAR modeling. Journal of Chemical Information and Computer Sciences. ,vol. 43, pp. 1947- 1958 ,(2003) , 10.1021/CI034160G
Alexey Tsymbal, Mykola Pechenizkiy, Pádraig Cunningham, Diversity in search strategies for ensemble feature selection Information Fusion. ,vol. 6, pp. 83- 98 ,(2005) , 10.1016/J.INFFUS.2004.04.003
Xiaoqing Zhang, Yizhuang Xu, Yuanfu Zhang, Lixin Wang, Chunsheng Hou, Xiaosi Zhou, Xiaofeng Ling, Zhi Xu, None, Intraoperative Detection of Thyroid Carcinoma by Fourier Transform Infrared Spectrometry Journal of Surgical Research. ,vol. 171, pp. 650- 656 ,(2011) , 10.1016/J.JSS.2010.05.031
Wei-song Yi, Dian-sheng Cui, Zhi Li, Lan-lan Wu, Ai-guo Shen, Ji-ming Hu, Gastric cancer differentiation using Fourier transform near-infrared spectroscopy with unsupervised pattern recognition. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy. ,vol. 101, pp. 127- 131 ,(2013) , 10.1016/J.SAA.2012.09.037
Chao Tan, Xin Qin, Menglong Li, An ensemble method based on a self-organizing map for near-infrared spectral calibration of complex beverage samples. Analytical and Bioanalytical Chemistry. ,vol. 392, pp. 515- 521 ,(2008) , 10.1007/S00216-008-2280-9