Identification and Classification of Web Pages with Specified Domain

作者: Poonam Nagale , Alka Vishwa

DOI: 10.5120/20793-3454

关键词: EntertainmentFirewall (construction)Computer scienceWorld Wide WebWeb pageThe InternetCluster analysis

摘要: Internet is very large source of information. But these flow information need to be controlled in various organizations, i.e. companies job portals and personal mail services are blocked, colleges entertainment related websites blocked. Consider college scenario, Admin have keep watch on site the student accessing. He uses proxy firewall sites that not allowed access. as per growth internet every day new launched market. It always feasible admin track that, also time we pay for each well it somewhat consuming. So, clustering web links into five domain keywords must preprocessed by Adaptive preprocessing technique increase performance system.

参考文章(11)
Arul Prakash Asirvatham, Kranthi Kumar, Web Page Classification based on Document Structure ,(2001)
Makoto Tsukada, Takashi Washio, Hiroshi Motoda, Automatic Web-Page Classification by Using Machine Learning Methods web intelligence. pp. 303- 313 ,(2001) , 10.1007/3-540-45490-X_36
S. M. Kamruzzaman, Web Page Categorization Using Artificial Neural Networks arXiv: Neural and Evolutionary Computing. ,(2010)
Indre Zliobaite, Bogdan Gabrys, Adaptive Preprocessing for Streaming Data IEEE Transactions on Knowledge and Data Engineering. ,vol. 26, pp. 309- 321 ,(2014) , 10.1109/TKDE.2012.147
Daniel Boley, Maria Gini, Robert Gross, Eui-Hong Sam Han, Kyle Hastings, George Karypis, Vipin Kumar, Bamshad Mobasher, Jerome Moore, None, Partitioning-based clustering for Web document categorization decision support systems. ,vol. 27, pp. 329- 341 ,(1999) , 10.1016/S0167-9236(99)00055-X
Albert Bifet, Geoff Holmes, Bernhard Pfahringer, Richard Kirkby, Ricard Gavaldà, New ensemble methods for evolving data streams Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '09. pp. 139- 148 ,(2009) , 10.1145/1557019.1557041
Petr Kadlec, Bogdan Gabrys, Architecture for development of adaptive on-line prediction models Memetic Computing. ,vol. 1, pp. 241- 269 ,(2009) , 10.1007/S12293-009-0017-8
Elena Ikonomovska, João Gama, Sašo Džeroski, Learning model trees from evolving data streams Data Mining and Knowledge Discovery. ,vol. 23, pp. 128- 168 ,(2011) , 10.1007/S10618-010-0201-Y
Segmenting Urls, Min-Yen Kan, Web Page Categorization without the Web Page ,(2004)
Dou Shen, Zheng Chen, Qiang Yang, Hua-Jun Zeng, Benyu Zhang, Yuchang Lu, Wei-Ying Ma, Web-page classification through summarization Proceedings of the 27th annual international conference on Research and development in information retrieval - SIGIR '04. pp. 242- 249 ,(2004) , 10.1145/1008992.1009035