Genetic semantic graph approach for multi-document abstractive summarization

作者: Atif Khan , Naomie Salim , Yogan Jaya Kumar

DOI: 10.1109/ICDIPC.2015.7323025

关键词:

摘要: The aim of automatic multi-document abstractive summarization is to create a compressed version the source text and preserves salient information. Existing graph based methods treat sentence as bag words, rely on content similarity measure did not consider semantic relationships between sentences. These may fail in determining redundant sentences that are semantically equivalent. This paper introduces genetic approach for summarization. Semantic from document set constructed such way nodes represent predicate argument structures (PASs), extracted automatically by employing role labeling (SRL); edges correspond weight determined PAS-to-PAS similarity, PAS-to-document relationship. relationship represented different features, weighted optimized algorithm. (PASs) ranked modified ranking In order reduce redundancy, we utilize maximal marginal relevance (MMR) re-ranks PASs use language generation generate summary top PASs. Experiment this study carried out using DUC-2002, standard corpus Experimental results reveal proposed performs better than other systems.

参考文章(36)
Günes Erkan, Dragomir R. Radev, LexPageRank: Prestige in Multi-Document Text Summarization empirical methods in natural language processing. pp. 365- 371 ,(2004)
Pierre-Etienne Genest, Guy Lapalme, Framework for Abstractive Summarization using Text-to-Text Generation meeting of the association for computational linguistics. pp. 64- 73 ,(2011)
Rada Mihalcea, Paul Tarau, A Language Independent Algorithm for Single and Multiple Document Summarization international joint conference on natural language processing. ,(2005)
Rajeev Motwani, Terry Winograd, Lawrence Page, Sergey Brin, The PageRank Citation Ranking : Bringing Order to the Web the web conference. ,vol. 98, pp. 161- 172 ,(1999)
Xiaojun Wan, Jianwu Yang, Improved affinity graph based multi-document summarization Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers on XX - NAACL '06. pp. 181- 184 ,(2006) , 10.3115/1614049.1614095
Furu Wei, Wenjie Li, Qin Lu, Yanxiang He, A document-sensitive graph model for multi-document summarization Knowledge and Information Systems. ,vol. 22, pp. 245- 259 ,(2010) , 10.1007/S10115-009-0194-2
Yi-Lun Shang, Multi-Type Directed Scale-Free Percolation Communications in Theoretical Physics. ,vol. 57, pp. 701- 716 ,(2012) , 10.1088/0253-6102/57/4/26
Atif Khan, Naomie Salim, Yogan Jaya Kumar, A framework for multi-document abstractive summarization based on semantic role labelling soft computing. ,vol. 30, pp. 737- 747 ,(2015) , 10.1016/J.ASOC.2015.01.070
Mustafa Yavaş, Gönenç Yücel, Impact of Homophily on Diffusion Dynamics Over Social Networks Social Science Computer Review. ,vol. 32, pp. 354- 372 ,(2014) , 10.1177/0894439313512464
Hideki Tanaka, Akinori Kinoshita, Takeshi Kobayakawa, Tadashi Kumano, Naoto Kato, Syntax-Driven Sentence Revision for Broadcast News Summarization Proceedings of the 2009 Workshop on Language Generation and Summarisation (UCNLG+Sum 2009). pp. 39- 47 ,(2009) , 10.3115/1708155.1708163