Opinion Summarization of Bangla Texts using Cosine Simillarity Based Graph Ranking and Relevance Based Approach

作者: Shofi Ullah , Sagar Hossain , K. M. Azharul Hasan

DOI: 10.1109/ICBSLP47725.2019.201494

关键词:

摘要: The main idea of the automatic extractive text or opinion summarization is to find most important representative small subset original document without any loss information. There are many existing methods available for English, Turkish, Arabic and other languages. But very few attempts has been done Bangla language because its having rich morphology multifaceted structure. In this paper, we propose a joint cosine simillarity based graph ranking Relevance scoring approach bangla text. We developed stemming algorithm on Parts Speech(POS) tagging consisting around two lakhs POS tags texts. A redundancy removal also proposed remove so that each sentences in summary represents exactly information document. performance evaluated by measuring recall, precision f-score Rouge metric it showed outperforms

参考文章(13)
Jianguo Xiao, Xiaojun Wan, Single document keyphrase extraction using neighborhood knowledge national conference on artificial intelligence. pp. 855- 860 ,(2008)
Kamal Sarkar, Bengali text summarization by sentence extraction arXiv: Information Retrieval. ,(2012)
Atif Khan, Naomie Salim, Yogan Jaya Kumar, Genetic semantic graph approach for multi-document abstractive summarization international conference on digital information processing and communications. pp. 173- 181 ,(2015) , 10.1109/ICDIPC.2015.7323025
Kamal Sarkar, A Keyphrase-Based Approach to Text Summarization for English and Bengali Documents International Journal of Technology Diffusion. ,vol. 5, pp. 28- 38 ,(2014) , 10.4018/IJTD.2014040103
Kamal Sarkar, An approach to summarizing Bengali news documents advances in computing and communications. pp. 857- 862 ,(2012) , 10.1145/2345396.2345535
Md Iftekharul Alam Efat, Mohammad Ibrahim, Humayun Kayesh, None, Automated Bangla text summarization by sentence scoring and ranking international conference on informatics electronics and vision. pp. 1- 5 ,(2013) , 10.1109/ICIEV.2013.6572686
Rejwanul Haque, Sudip Kumar Naskar, Andy Way, Marta R. Costa-jussa, Rafael E. Banchs, Sentence Similarity-Based Source Context Modelling in PBSMT international conference on asian language processing. pp. 257- 260 ,(2010) , 10.1109/IALP.2010.45
Chin-Yew Lin, ROUGE: A Package for Automatic Evaluation of Summaries meeting of the association for computational linguistics. pp. 74- 81 ,(2004)
Alfirna Rizqi Lahitani, Adhistya Erna Permanasari, Noor Akhmad Setiawan, Cosine similarity to determine similarity measure: Study case in online essay assessment 2016 4th International Conference on Cyber and IT Service Management. pp. 1- 6 ,(2016) , 10.1109/CITSM.2016.7577578
Md. Majharul Haque, Suraiya Pervin, Zerina Begum, Enhancement of keyphrase-based approach of automatic Bangla text summarization ieee region 10 conference. pp. 42- 46 ,(2016) , 10.1109/TENCON.2016.7847955