Data mining methodologies for supporting engineers during system identification

作者: Sandro Saitta

DOI: 10.5075/EPFL-THESIS-4056

关键词: Feature selectionSet (abstract data type)Artificial intelligenceSystem identificationMachine learningComputer scienceData setData stream miningCluster analysisData collectionData miningDecision support system

摘要: Data alone are worth almost nothing. While data collection is increasing exponentially worldwide, a clear distinction between retrieving and obtaining knowledge has to be made. retrieved while measuring phenomena or gathering facts. Knowledge refers patterns trends that useful for decision making. interpretation creates challenge particularly present in system identification, where thousands of models may explain given set measurements. Manually interpreting such not reliable. One solution use mining. This thesis thus proposes an integration techniques from mining, field research the aim find data, into existing multiple-model identification methodology. It shown that, within framework support, mining constitute valuable tool engineers performing identification. For example, clustering group similar together order guide subsequent decisions since they might indicate possible states structure. A main issue concerns number clusters, which, usually, unknown. determining correct clusters estimating quality algorithm, score function proposed. The reliable index set, understanding results. Furthermore, information who perform achieved through feature selection techniques. They allow relevant parameters candidate models. core algorithm strategy based on global search. In addition providing about model space, found supporting related sensor placement. When integrated methodology iterative placement, provide support rational basis placement structures. Greedy search strategies should selected according context. Experiments show whereas more efficient initial greedy suitable

参考文章(189)
Asa Ben-Hur, Andre Elisseeff, Isabelle Guyon, A stability based method for discovering structure in clustered data. pacific symposium on biocomputing. pp. 6- 17 ,(2001) , 10.1142/9789812799623_0002
Mohamed Bouguessa, Shengrui Wang, Haojun Sun, An objective approach to cluster validation Pattern Recognition Letters. ,vol. 27, pp. 1419- 1430 ,(2006) , 10.1016/J.PATREC.2006.01.015
Costas Papadimitriou, James L. Beck, Siu-Kui Au, Entropy-Based Optimal Sensor Location for Structural Model Updating Journal of Vibration and Control. ,vol. 6, pp. 781- 800 ,(2000) , 10.1177/107754630000600508
Y. Robert-Nicoud, B. Raphael, O. Burdet, I. F. C. Smith, Model Identification of Bridges Using Measurement Data Computer-aided Civil and Infrastructure Engineering. ,vol. 20, pp. 118- 131 ,(2005) , 10.1111/J.1467-8667.2005.00381.X
Dale W. Jorgenson, Jerald Hunter, M. Ishag Nadiri, The Predictive Performance of Econometric Models of Quarterly Investment Behavior Econometrica. ,vol. 38, pp. 213- 224 ,(1970) , 10.2307/1913004
Lotfi A. Zadeh, Applied Soft Computing – Foreword Applied Soft Computing. ,vol. 1, pp. 1- 2 ,(2001) , 10.1016/S1568-4946(01)00003-5
P. Pudil, J. Novovičová, J. Kittler, Floating search methods in feature selection Pattern Recognition Letters. ,vol. 15, pp. 1119- 1125 ,(1994) , 10.1016/0167-8655(94)90127-9
Ian F. Smith, Sandro Saitta, Improving Knowledge of Structural System Behavior through Multiple Models Journal of Structural Engineering-asce. ,vol. 134, pp. 553- 561 ,(2008) , 10.1061/(ASCE)0733-9445(2008)134:4(553)