作者: Zheng-Feng LI , Guang-Jin XU , Jia-Jun WANG , Guo-Rong DU , Wen-Sheng CAI
DOI: 10.1016/S1872-2040(16)60907-6
关键词: Outlier 、 Chemistry 、 Artificial intelligence 、 Partial least squares regression 、 Anomaly detection 、 Mean squared error 、 Robust regression 、 Pattern recognition 、 Calibration (statistics) 、 Orange juice 、 Cross-validation
摘要: Abstract Outlier detection is an important task in multivariate calibration because the quality of a model determined by that data. An outlier method was proposed for near infrared (NIR) spectral analysis. The based on definition and principle partial least squares (PLS) regression, i.e., dataset behaved differently from rest, prediction result PLS accumulation several independent latent variables. Therefore, built with dataset, then contribution each variable investigated. Outliers were detected comparing these contributions. NIR orange juice samples adopted testing method. Six outliers set. root mean squared error cross validation (RMSECV) reduced 16.870 to 4.809 (RMSEP) 3.688 3.332 after removal outliers. Compared robust regression method, seemed more reasonable.