A social graph based text mining framework for chat log investigation

作者: Tarique Anwar , Muhammad Abulaish

DOI: 10.1016/J.DIIN.2014.10.001

关键词:

摘要: This paper presents a unified social graph based text mining framework to identify digital evidences from chat logs data. It considers both users' conversation and interaction data in group-chats discover overlapping interests their ties. The proposed applies n-gram technique association with self-customized hyperlink-induced topic search (HITS) algorithm key-terms representing interests, key-users, key-sessions. We propose generation model interactions, where ties (edges) between pair of users (nodes) are established only if they participate at least one common group-chat session, weights assigned the on degree overlap interactions. Finally, we present three possible cyber-crime investigation scenarios user-group identification method for each them. our experimental results set comprising 1100 11,143 sessions continued over period 29 months January 2010 May 2012. Experimental suggest that is able key-terms, key-sessions, user-groups data, all which crucial investigation. Though recovered single computer, it very likely collected multiple computers real scenario. In this case, can be combined together generate more enriched graph. However, experiments show objectives achieved even computer by using draw relationships every users.

参考文章(37)
Jason Bengel, Susan Gauch, Eera Mittur, Rajan Vijayaraghavan, ChatTrack: Chat room topic detection using classification intelligence and security informatics. pp. 266- 277 ,(2004) , 10.1007/978-3-540-25952-7_20
Yu Xiao, Jian Yu, Partitive clustering ( K -means family) Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery. ,vol. 2, pp. 209- 225 ,(2012) , 10.1002/WIDM.1049
Ahmad Kamal, Muhammad Abulaish, Tarique Anwar, Mining feature-opinion pairs and their reliability scores from web opinion sources web intelligence, mining and semantics. pp. 15- ,(2012) , 10.1145/2254129.2254150
A. K. Jain, M. N. Murty, P. J. Flynn, Data clustering: a review ACM Computing Surveys. ,vol. 31, pp. 264- 323 ,(1999) , 10.1145/331499.331504
Muhammad Abulaish, Tarique Anwar, A web content mining approach for tag cloud generation Proceedings of the 13th International Conference on Information Integration and Web-based Applications and Services - iiWAS '11. pp. 52- 59 ,(2011) , 10.1145/2095536.2095548
Tarique Anwar, Muhammad Abulaish, Identifying cliques in dark web forums - An agglomerative clustering approach intelligence and security informatics. pp. 171- 173 ,(2012) , 10.1109/ISI.2012.6284289
Tayfun Kucukyilmaz, B. Barla Cambazoglu, Cevdet Aykanat, Fazli Can, Chat mining: Predicting user and message attributes in computer-mediated communication Information Processing & Management. ,vol. 44, pp. 1448- 1466 ,(2008) , 10.1016/J.IPM.2007.12.009
Yeha Lee, Hun-young Jung, Woosang Song, Jong-Hyeok Lee, Mining the blogosphere for top news stories identification international acm sigir conference on research and development in information retrieval. pp. 395- 402 ,(2010) , 10.1145/1835449.1835516
Muhammad Abulaish, Tarique Anwar, A Keyphrase-Based Tag Cloud Generation Framework to Conceptualize Textual Data International Journal of Adaptive, Resilient and Autonomic Systems. ,vol. 4, pp. 72- 93 ,(2013) , 10.4018/JARAS.2013040104