A Method of Predicting News Update Time Combining Exponential Smoothing and Naive Bayes

作者: Mengmeng Wang , Xianglin Zuo , Ying Wang , Wanli Zuo

DOI: 10.1109/ISISE.2012.57

关键词:

摘要: The time of web page update appears to be erratic, how the user can fast access valuable information has become one hot spots. From view application, we use mathematical models forecast news reports, although it not completely accurate. In this paper, proposed a combined predict algorithm for update. Firstly, applied Exponential Smoothing method our dataset. Secondly, leveraged Naive Bayes Model prediction. Finally, two methods Combination Forecasting. Through experiments, show that Forecasting outperforms other while estimating localized rate updates.

参考文章(7)
Gautam Pant, Filippo Menczer, Topical Crawling for Business Intelligence international conference theory and practice digital libraries. pp. 233- 244 ,(2003) , 10.1007/978-3-540-45175-4_22
Vangelis Karkaletsis, Shipra Dingare, Georgios Paliouras, Claire Grover, James Curran, Konstantinos Stamatakis, James Horlock, Domain-specific Web site identification: the CROSSMARC focused Web crawler pp. 75- 78 ,(2003)
Thompson S.H Teo, Wing Yee Choo, Assessing the impact of using the Internet for competitive intelligence Information & Management. ,vol. 39, pp. 67- 83 ,(2001) , 10.1016/S0378-7206(01)00080-5
Dennis Fetterly, Mark Manasse, Marc Najork, Janet L. Wiener, A large-scale study of the evolution of web pages Software - Practice and Experience. ,vol. 34, pp. 213- 237 ,(2004) , 10.1002/SPE.577
Brian E. Brewington, George Cybenko, How dynamic is the Web the web conference. ,vol. 33, pp. 257- 276 ,(2000) , 10.1016/S1389-1286(00)00045-1
Filippo Menczer, Richard K. Belew, Adaptive Retrieval Agents: Internalizing Local Contextand Scaling up to the Web Machine Learning. ,vol. 39, pp. 203- 242 ,(2000) , 10.1023/A:1007653114902
Filippo Menczer, Gautam Pant, Padmini Srinivasan, Topical web crawlers: Evaluating adaptive algorithms ACM Transactions on Internet Technology. ,vol. 4, pp. 378- 419 ,(2004) , 10.1145/1031114.1031117