作者: Claudio Carpineto , Stanislaw Osiński , Giovanni Romano , Dawid Weiss
关键词:
摘要: Web clustering engines organize search results by topic, thus offering a complementary view to the flat-ranked list returned conventional engines. In this survey, we discuss issues that must be addressed in development of engine, including acquisition and preprocessing results, their visualization. Search clustering, core system, has specific requirements cannot classical algorithms. We emphasize role played quality cluster labels as opposed optimizing only structure. highlight main characteristics number existing also how evaluate retrieval performance. Some directions for future research are finally presented.