On discovering co-location patterns in datasets: a case study of pollutants and child cancers

作者: Jundong Li , Aibek Adilmagambetov , Mohomed Shazan Mohomed Jabbar , Osmar R Zaïane , Alvaro Osornio-Vargas

DOI: 10.1007/S10707-016-0254-1

关键词:

摘要: We intend to identify relationships between cancer cases and pollutant emissions by proposing a novel co-location mining algorithm. In this context, we specifically attempt understand whether there is relationship the location of child diagnosed with any chemical combinations emitted from various facilities in that particular location. Co-location pattern intends detect sets spatial features frequently located close proximity each other. Most previous works domain are based on transaction-free apriori-like algorithms which dependent user-defined thresholds, designed for boolean data points. Due absence clear notion transactions, it nontrivial use association rule techniques tackle problem. Our proposed approach focused grid transactionization? geographic space, mine datasets extended objects. It also capable incorporating uncertainty existence model real world scenarios more accurately. eliminate necessity using global threshold introducing statistical test validate significance candidate patterns rules. Experiments both synthetic reveal our algorithm can considerable amount statistically significant patterns. addition, explain modelling framework used pollutants (PRTR/NPRI) childhood cases.

参考文章(37)
Hui Xiong, Shashi Shekhar, Yan Huang, Vipin Kumar, Xiaobin Ma, Jin Soung Yoc, A framework for discovering co-location patterns in data sets with extended spatial objects siam international conference on data mining. pp. 78- 89 ,(2004) , 10.1137/1.9781611972740.8
Sajib Barua, Jörg Sander, SSCP: mining statistically significant co-location patterns symposium on large spatial databases. pp. 2- 20 ,(2011) , 10.1007/978-3-642-22922-0_2
Aibek Adilmagambetov, Osmar R. Zaiane, Alvaro Osornio-Vargas, Discovering Co-location Patterns in Datasets with Extended Spatial Objects data warehousing and knowledge discovery. pp. 84- 96 ,(2013) , 10.1007/978-3-642-40131-2_8
Jundong Li, Osmar R. Zaïane, Alvaro Osornio-Vargas, Discovering Statistically Significant Co-location Rules in Datasets with Extended Spatial Objects data warehousing and knowledge discovery. pp. 124- 135 ,(2014) , 10.1007/978-3-319-10160-6_12
Jian Pei, Jiawei Han, Hongjun Lu, Shojiro Nishio, Shiwei Tang, Dongqing Yang, H-mine: hyper-structure mining of frequent patterns in large databases international conference on data mining. pp. 441- 448 ,(2001) , 10.1109/ICDM.2001.989550
Ramakrishnan Srikant, Rakesh Agrawal, Fast Algorithms for Mining Association Rules in Large Databases very large data bases. pp. 487- 499 ,(1994)
Shashi Shekhar, Yan Huang, Discovering Spatial Co-location Patterns: A Summary of Results symposium on large spatial databases. pp. 236- 256 ,(2001) , 10.1007/3-540-47724-1_13
Vladimir Estivill-Castrol, Alan T. Murray, Discovering Associations in Spatial Data - An Efficient Medoid Based Approach knowledge discovery and data mining. ,vol. 1394, pp. 110- 121 ,(1998) , 10.1007/3-540-64383-4_10