作者: Craig G. Nevill-Manning , Gordon W. Paynter , Carl Gutwin , Ian H. Witten , Eibe Frank
DOI:
关键词:
摘要: Keyphrases are an important means of document summarization, clustering, and topic search. Only a small minority documents have author-assigned keyphrases, manually assigning keyphrases to existing is very laborious. Therefore it highly desirable automate the keyphrase extraction process. This paper shows that simple procedure for based on naive Bayes learning scheme performs comparably state art. It goes explain how this procedure's performance can be boosted by automatically tailoring process particular collection at hand. Results large technical reports in computer science show quality extracted improves significantly when domain-specific information exploited.