Reconciliation of outliers in CO2-alkanolamine-H2O datasets by robust neural network winsorization

作者: Humbul Suleman , Abdulhalim Shah Maulud , Zakaria Man

DOI: 10.1007/S00521-016-2213-Z

关键词: Data miningArtificial neural networkRobust statisticsOutlierIdentification (information)Computer scienceWinsorized meanNonlinear system

摘要: It is normal to find at least a few measured values in CO2-alkanolamine-H2O datasets that deviate greatly from the majority of published data, as data come different sources. These values, termed outliers, are major source conflict modeling, simulation and process development studies. Therefore, removal outliers mandatory. However, available statistical techniques known lose information boundaries system exhibit substantial deviation holistic trend. Hence, an adaptive approach combining artificial neural networks robust winsorization presented for identification reconciliation system. The proposed flexibly transforms nonlinear distribution predicts corrected (winsorized values), thus maintaining extremes results have been graphically analyzed show good conformance treated with retention winsorized values. method improves shortcomings previous approaches can be potentially extended other experimental chemical systems.

参考文章(82)
Barry J. Wythoff, Backpropagation neural networks Chemometrics and Intelligent Laboratory Systems. ,vol. 18, pp. 115- 155 ,(1993) , 10.1016/0169-7439(93)80052-J
B. Lemoine, Yi-Gui Li, R. Cadours, C. Bouallou, D. Richon, Partial vapor pressure of CO2 and H2S over aqueous methyldiethanolamine solutions Fluid Phase Equilibria. ,vol. 172, pp. 261- 277 ,(2000) , 10.1016/S0378-3812(00)00383-6
P. C. Mahalanobis, On the generalized distance in statistics Proceedings of the National Institute of Sciences (Calcutta). ,vol. 2, pp. 49- 55 ,(1936)
James Franklin, The elements of statistical learning : data mining, inference,and prediction The Mathematical Intelligencer. ,vol. 27, pp. 83- 85 ,(2005) , 10.1007/BF02985802
Richard A Olshen, Charles J Stone, Leo Breiman, Jerome H Friedman, Classification and regression trees ,(1983)
Lennart Eriksson, Johan Trygg, Svante Wold, A chemometrics toolbox based on projections and latent variables Journal of Chemometrics. ,vol. 28, pp. 332- 346 ,(2014) , 10.1002/CEM.2581
Sadam Alwadi, EXISTING OUTLIER VALUES IN FINANCIAL DATA VIA WAVELET TRANSFORM European scientific journal. ,vol. 11, ,(2015)
J. I. Lee, F. D. Otto, A. E. Mather, The solubility of H2S and CO2 in aqueous monoethanolamine solutions Canadian Journal of Chemical Engineering. ,vol. 52, pp. 803- 805 ,(1974) , 10.1002/CJCE.5450520617
G. Kateman, Neural networks in analytical chemistry Chemometrics and Intelligent Laboratory Systems. ,vol. 19, pp. 135- 142 ,(1993) , 10.1016/0169-7439(93)80097-2