作者: YLLIAS CHALI , SADID A. HASAN
DOI: 10.1017/S1351324911000167
关键词: Semantic similarity 、 Artificial intelligence 、 Semi-supervised learning 、 Computer science 、 Machine learning 、 Support vector machine 、 Natural language processing 、 Automatic summarization 、 Supervised learning 、 Multi-document summarization 、 Conditional random field 、 Similarity measure
摘要: In this paper, we apply different supervised learning techniques to build query-focused multi-document summarization systems, where the task is produce automatic summaries in response a given query or specific information request stated by user. A huge amount of labeled data prerequisite for training. It expensive and time-consuming when humans perform labeling manually. Automatic can be good remedy problem. We employ five annotation extracts from human abstracts using ROUGE, Basic Element overlap, syntactic similarity measure, semantic Extended String Subsequence Kernel. The methods use are Support Vector Machines, Conditional Random Fields, Hidden Markov Models, Maximum Entropy, two ensemble-based approaches. During experiments, analyze impact on performance applied methods. To our knowledge, no other study has deeply investigated compared effects approaches domain summarization.