作者: Jungi Kim , In-Su Kang , Jong-Hyeok Lee
DOI: 10.1007/11940098_22
关键词:
摘要: A patent collection provides a great test-bed for cluster-based information retrieval. International Patent Classification (IPC) system hierarchical taxonomy with 5 levels of specificity. We regard IPC codes applications as cluster information, manually assigned by officers according to their subjects. Such manual advantages over auto-matically built clusters using document term similarities. There are previous researches that successfully apply retrieval models language modeling. develop employ having clustered documents.