Text Summarization Based on Genetic Programming

作者: Hamid Khosravi , Pooya Khosraviyan Dehkordi , Farshad Kumarci

DOI:

关键词:

摘要: This work proposes an approach to address the problem of improving content selection in automatic text summarization by using some statistical tools. is a trainable summarizer, which takes into account several features, for each sentence generate summaries. First, we investigate effect feature on task. Then use all features combination train genetic programming (GP), vector and fuzzy order construct summarizer model. Furthermore, trained models test performance. The proposed performance measured at compression rates data corpus composed 17 English scientific articles.

参考文章(13)
Satoshi Sekine, Chikashi Nobata, Sentence Extraction with Information Extraction technique. In: the Document Understanding Conference; 2001.. ,(2001)
Joel Larocca Neto, Alex A. Freitas, Celso A. A. Kaestner, Automatic Text Summarization Using a Machine Learning Approach brazilian symposium on artificial intelligence. pp. 205- 215 ,(2002) , 10.1007/3-540-36127-8_20
H. P. Luhn, The automatic creation of literature abstracts Ibm Journal of Research and Development. ,vol. 2, pp. 159- 165 ,(1958) , 10.1147/RD.22.0159
Gerard Salton, Christopher Buckley, Term Weighting Approaches in Automatic Text Retrieval Information Processing and Management. ,vol. 24, pp. 323- 328 ,(1988) , 10.1016/0306-4573(88)90021-0
Jen-Yuan Yeh, Hao-Ren Ke, Wei-Pang Yang, I-Heng Meng, Text summarization using a trainable summarizer and latent semantic analysis Information Processing and Management. ,vol. 41, pp. 75- 95 ,(2005) , 10.1016/J.IPM.2004.04.003
Sanda Harabagiu, Andrew Hickl, Finley Lacatusu, Satisfying information needs with multi-document summaries Information Processing & Management. ,vol. 43, pp. 1619- 1642 ,(2007) , 10.1016/J.IPM.2007.01.004
David Zajic, Bonnie J. Dorr, Jimmy Lin, Richard Schwartz, Multi-candidate reduction: Sentence compression as a tool for document summarization tasks Information Processing & Management. ,vol. 43, pp. 1549- 1570 ,(2007) , 10.1016/J.IPM.2007.01.016
Tadashi Nomoto, Discriminative sentence compression with conditional random fields Information Processing & Management. ,vol. 43, pp. 1571- 1587 ,(2007) , 10.1016/J.IPM.2007.01.025
Tsutomu Hirao, Manabu Okumura, Norihito Yasuda, Hideki Isozaki, Supervised automatic evaluation for summarization with voted regression model Information Processing & Management. ,vol. 43, pp. 1521- 1535 ,(2007) , 10.1016/J.IPM.2007.01.012
Marie-Francine Moens, Summarizing court decisions Information Processing & Management. ,vol. 43, pp. 1748- 1764 ,(2007) , 10.1016/J.IPM.2007.01.005