作者: Pablo E. Román , Robert F. Dell , Juan D. Velásquez
DOI: 10.1007/978-3-642-14461-5_2
关键词:
摘要: Central to successful e-business is the construction of web sites that attract users, capture user preferences, and entice them into making a purchase. Web mining diverse data applied categorize both content structure with goal aiding e-business. requires knowledge site (hyperlink graph), (vector model) sessions (the sequence pages visited by each site). Much for can be noisy. The origin noise comes from many sources, example, undocumented changes content, different understanding text media semantic, logs without individual identification. There may not any record number times specific page has been in session as stored on proxy or browser cache. Such presents challenge mining. This chapter issues approaches cleaning preparation analysis.