作者: Natalie S. Glance , Matthew Hurst , Takashi Tomokiyo
DOI:
关键词:
摘要: Over the past few years, weblogs have emerged as a new communication and publication medium on Internet. In this paper, we describe application of data mining, information extraction NLP algorithms for discovering trends across our subset approximately 100,000 weblogs. We publish daily lists key persons, phrases, paragraphs to public web site, BlogPulse.com. addition, maintain searchable index weblog entries. On top search index, implemented trend search, which graphs normalized line over time query provides way estimate relative buzz word mouth given topics time.