Qualitative Aspects in Numerical Data: Structure, Interaction, and Correlation

作者: Aleks Jakulin

DOI:

关键词: DiscretizationMachine learningApplied mathematicsArtificial intelligenceProbabilistic logicStatistical modelFinite mixtureData structureEntropy (information theory)MathematicsCorrelationContinuous data

摘要: Entropy and information are common measures of probabilistic models data, frequently used for discrete discretized but more rarely continuous data. We employ finite mixture models, which handle data simultaneously, to construct a model whose entropy can be estimated. Analytic estimates intractable some so an approximate sample estimate is used. A simplified formulation bootstrap employed assess the distribution entropy, then represented with confidence intervals. show how as unified approach quantifying three fundamental qualitative aspects models: correlation aspect that corresponds linearities in model, structure helps capture model’s nonlinearities, interaction underlying entanglements attributes resulting from or both.

参考文章(17)
Trevor Hastie, Y. Dan Rubinstein, Discriminative vs informative learning knowledge discovery and data mining. pp. 49- 53 ,(1997)
Wray Buntine, Variational Extensions to EM and Multinomial PCA Lecture Notes in Computer Science. pp. 23- 34 ,(2002) , 10.1007/3-540-36755-1_3
Aleks Jakulin, Ivan Bratko, Analyzing Attribute Dependencies european conference on principles of data mining and knowledge discovery. pp. 229- 240 ,(2003) , 10.1007/978-3-540-39804-2_22
Ricardo Vilalta, Irina Rish, A decomposition of classes via clustering to explain and improve Naive Bayes european conference on machine learning. pp. 444- 455 ,(2003) , 10.1007/978-3-540-39857-8_40
Lynette Hunt, Murray Jorgensen, Mixture model clustering using the MULTIMIX program Australian & New Zealand Journal of Statistics. ,vol. 41, pp. 154- 171 ,(1999) , 10.1111/1467-842X.00071
J McLachlan, G, D. Peel, Finite Mixture Models ,(2000)
1Dorian Šuc, Ivan Bratko, Induction of Qualitative Trees european conference on machine learning. pp. 442- 453 ,(2001) , 10.1007/3-540-44795-4_38
A. Philip Dawid, Peter D. Grünwald, Game theory, maximum entropy, minimum discrepancy and robust Bayesian decision theory Annals of Statistics. ,vol. 32, pp. 1367- 1433 ,(2004) , 10.1214/009053604000000553
Gregory F. Cooper, Stefano Monti, A Bayesian network classifier that combines a finite mixture model and a naïve bayes model uncertainty in artificial intelligence. pp. 447- 456 ,(1999)