Mining Heterogeneous Information Networks: Principles and Methodologies

作者: Jiawei Han , Yizhou Sun

DOI:

关键词: Link analysisData miningData scienceComputer scienceBig dataRelation (database)Heterogeneous networkNetwork modelComplex networkRankingCluster analysis

摘要: Real-world physical and abstract data objects are interconnected, forming gigantic, interconnected networks. By structuring these interactions between into multiple types, such networks become semi-structured heterogeneous information Most real-world applications that handle big data, including social media networks, scientific, engineering, or medical systems, online e-commerce most database can be structured Therefore, effective analysis of large-scale poses an interesting but critical challenge. In this book, we investigate the principles methodologies mining Departing from many existing network models view as homogeneous graphs our model leverages rich semantics typed nodes links in a uncovers surprisingly knowledge network. This modeling leads to series new powerful for including: (1) rank-based clustering classification; (2) meta-path-based similarity search mining; (3) relation strength-aware mining, other potential developments. book introduces research frontier points out some promising directions. Table Contents: Introduction / Ranking-Based Clustering Classification Heterogeneous Information Networks Meta-Path-Based Similarity Search Relationship Prediction Relation Strength-Aware with Incomplete Attributes User-Guided via Meta-Path Selection Research Frontiers

参考文章(94)
Free Press, Macmillan Company, Everett Mitchell Rogers, Diffusion of Innovations ,(1962)
Douglass R. Cutting, David R. Karger, Jan O. Pedersen, John W. Tukey, Scatter/Gather: a cluster-based approach to browsing large document collections international acm sigir conference on research and development in information retrieval. ,vol. 51, pp. 318- 329 ,(1992) , 10.1145/3130348.3130362
Ryan N. Lichtenwalter, Jake T. Lussier, Nitesh V. Chawla, New perspectives and methods in link prediction knowledge discovery and data mining. pp. 243- 252 ,(2010) , 10.1145/1835804.1835837
Albert-László Barabási, Réka Albert, Emergence of Scaling in Random Networks Science. ,vol. 286, pp. 509- 512 ,(1999) , 10.1126/SCIENCE.286.5439.509
Pável Calado, Marco Cristo, Edleno Moura, Nivio Ziviani, Berthier Ribeiro-Neto, Marcos André Gonçalves, Combining link-based and content-based methods for web document classification Proceedings of the twelfth international conference on Information and knowledge management - CIKM '03. pp. 394- 401 ,(2003) , 10.1145/956863.956938
Xiao Yu, Yizhou Sun, Peixiang Zhao, Jiawei Han, Query-driven discovery of semantically similar substructures in heterogeneous networks Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '12. pp. 1500- 1503 ,(2012) , 10.1145/2339530.2339765
Chi Wang, Jiawei Han, Yuntao Jia, Jie Tang, Duo Zhang, Yintao Yu, Jingyi Guo, Mining advisor-advisee relationships from research publication networks knowledge discovery and data mining. pp. 203- 212 ,(2010) , 10.1145/1835804.1835833
Leo Katz, A new status index derived from sociometric analysis Psychometrika. ,vol. 18, pp. 39- 43 ,(1953) , 10.1007/BF02289026
Kunal Punera, Joydeep Ghosh, CONSENSUS-BASED ENSEMBLES OF SOFT CLUSTERINGS Applied Artificial Intelligence. ,vol. 22, pp. 780- 810 ,(2008) , 10.1080/08839510802170546
M. E. J. Newman, Assortative mixing in networks. Physical Review Letters. ,vol. 89, pp. 208701- 208701 ,(2002) , 10.1103/PHYSREVLETT.89.208701