Method and apparatus for generating time-series data from web pages

作者: Shigeaki Sakurai , Norihiko Sawa

DOI:

关键词:

摘要: According to one embodiment, the Web pages that match a user's designated collection condition are collected from plurality of sites. The divided into clusters, based on URL information pages. A date expression is extracted included in each clusters. typical form determined for expression. clusters items, form. items sorted order time, expressions corresponding items. Time-series data generated by sorting

参考文章(20)
Steven Popovitch, Incremental search engine ,(2002)
Peter Gerstl, Roland Seiffert, Adrian Mueller, Jochen Doerre, Sebastian Goeser, Taxonomy generation for document collections ,(1999)
Pierre-Yves Chevalier, Yves Mahe, Bruno Roustant, Structured contextual clustering method and system in a federated search engine ,(2002)
Woojin Paik, Elizabeth D. Liddy, Jennifer Heverin Liddy, Ian Harcourt Niles, Eileen E. Allen, Information extraction system and method using concept-relation-concept (CRC) triples ,(1997)
Manish Bhide, Mukesh Mahania, Ajay Gupta, Ordering of web search results ,(2003)
Apostolos Gerasoulis, Hyun-Ju Seo, Wei Wang, Retrieval and display of data objects using a cross-group ranking metric ,(2005)
Bernice Rogowitz, Aleksandra Mojsilovic, Jose Gomes, System and method for measuring image similarity based on semantic meaning ,(2002)