作者: Gang Lu , Shumei Liu , Kevin Lü
DOI: 10.1007/978-3-642-34531-9_13
关键词:
摘要: Getting data is the precondition of researching on micro-blogging services. By using Web 2.0 techniques such as AJAX, contents micro-blog pages are dynamically generated rapidly. That makes it hard for traditional page crawler to crawl pages. Micro-blogging services provide some APIs. Through APIs, well-structured can be easily obtained. A software architecture service crawler, which named MBCrawler, designed basing APIs provided by The modular and scalable, so fit specific features different SinaMBCrawler, a application based MBCrawler Sina Weibo, has been developed. It automatically invokes Weibo data. crawled saved into local database.