Model based approach for household clustering with mixed scale variables

作者: Luis Nieto-Barajas , Christian Carmona , Antonio Canale

DOI:

关键词: Social changeSample (statistics)Cluster analysisComputer scienceSampling (statistics)Scale (social sciences)Mixture modelPopulationPovertyStatisticsEconometrics

摘要: The Ministry of Social Development in Mexico is charge creating and assigning social programmes targeting specific needs the population for improvement quality life. To better target programmes, aimed to find clusters households with same based on demographic characteristics as well poverty conditions household. Available data consists continuous, ordinal, nominal variables observations are not iid but come from a survey sample complex design. We propose Bayesian nonparametric mixture model that jointly models this mixed scale accommodates different sampling probabilities. performance assessed via simulated data. A full analysis socio-economic State presented.

参考文章(28)
Zoubin Ghahramani, Sara Wade, Bayesian Cluster Analysis: Point Estimation and Credible Balls Bayesian Analysis. ,vol. 13, pp. 47- 49 ,(2017) , 10.1214/17-BA1073
Robert McCulloch, Xiao Li Meng, John Barnard, Modeling covariance matrices in terms of standard deviations and correlations, with application to shrinkage Statistica Sinica. ,vol. 10, pp. 1281- 1311 ,(2000)
Antonio Canale, Bruno Scarpa, Bayesian nonparametric location–scale–shape mixtures Test. ,vol. 25, pp. 113- 130 ,(2016) , 10.1007/S11749-015-0446-2
LG Leon-Novelo, X Zhou, B Nebiyou Bekele, P Müller, None, Assessing toxicities in a clinical trial: Bayesian inference for ordinal data nested within categories Biometrics. ,vol. 66, pp. 966- 974 ,(2010) , 10.1111/J.1541-0420.2009.01359.X
Antonio Canale, David B. Dunson, Bayesian Kernel Mixtures for Counts Journal of the American Statistical Association. ,vol. 106, pp. 1528- 1539 ,(2011) , 10.1198/JASA.2011.TM10552
Geoffrey J. McLachlan, Kaye E. Basford, Mixture models : inference and applications to clustering Statistics: Textbooks and Monographs. ,vol. 84, ,(1988)
Jing-Heng Cai, Xin-Yuan Song, Kwok-Hap Lam, Edward Hak-Sing Ip, A mixture of generalized latent variable models for mixed mode and heterogeneous data Computational Statistics & Data Analysis. ,vol. 55, pp. 2889- 2907 ,(2011) , 10.1016/J.CSDA.2011.05.011
Luis E. Nieto-Barajas, Alberto Contreras-Cristán, A Bayesian Nonparametric Approach for Time Series Clustering Bayesian Analysis. ,vol. 9, pp. 147- 170 ,(2014) , 10.1214/13-BA852
Christian Ritter, Martin A. Tanner, Facilitating the Gibbs Sampler: The Gibbs Stopper and the Griddy-Gibbs Sampler Journal of the American Statistical Association. ,vol. 87, pp. 861- 868 ,(1992) , 10.1080/01621459.1992.10475289
D. Fernández, R. Arnold, S. Pledger, Mixture-based clustering for the ordered stereotype model Computational Statistics & Data Analysis. ,vol. 93, pp. 46- 75 ,(2016) , 10.1016/J.CSDA.2014.11.004