Utilizing offline clusters for realtime clustering of search results

作者: Srinivas Vadrevu , Alex J. Smola , Byron E. Dom , Choon Hui Teo , Suju Rajan

DOI:

关键词:

摘要: Techniques for clustering of search results are described. In an example embodiment, a plurality first clusters is determined, in corpus articles, independently user queries issued against the where each cluster represents group articles that relate to news story. One or more identifiers assigned article corpus, one respectively identify which belongs. A query specifies criteria received. response receiving query, result generated by at least selecting, from set based on criteria. The selected grouped into second articles. organized according clusters.

参考文章(8)
Raul E. Valdes-Perez, Christopher Robert Palmer, Andre dos Santos Lessa, Jerome Pesenti, Clustering system and method ,(2007)
Hung Chim, Xiaotie Deng, A new suffix tree similarity measure for document clustering the web conference. pp. 121- 130 ,(2007) , 10.1145/1242572.1242590
Arvind Arasu, Shriraghav Kaushik, Venkatesh Ganti, Disk-Based Probabilistic Set-Similarity Indexes ,(2007)
Kumar Hemachandra Chellapilla, Gregory T. Buehrer, Web graph compression through scalable pattern mining ,(2010)