作者: Xiao Wei , Xiangfeng Luo , Qing Li
DOI: 10.4018/IJCINI.2013040104
关键词: Information retrieval 、 World Wide Web 、 Web service 、 Search engine 、 Service (systems architecture) 、 Semantic compression 、 Duplicate content 、 Compression ratio 、 Computer science 、 Reading (process) 、 Web page
摘要: Both compression and decompression play important roles in a web service system. High ratio helps to save the storage, while fast contributes decreasing response time of service. Specifically focusing on news service, this paper proposes mechanism improve efficiency simultaneously by taking advantage semantic relations among webpages. Firstly, webpages are clustered into topics according similarity relation Webpages belonging same topic have much duplicate content, which can when using delta-compression. Secondly, associated detected with help multiple-semantics link network topics. Associated compressed zip file may decrease times habit user's reading Web. The authors apply proposed practical search engine experimental results show that it has high speed as well.