作者: Jacob Abernethy , Olivier Chapelle , Carlos Castillo
关键词:
摘要: We present an algorithm, witch, that learns to detect spam hosts or pages on the Web. Unlike most other approaches, it simultaneously exploits structure of Web graph as well page contents and features. The method is efficient, scalable, provides state-of-the-art accuracy a standard benchmark.