作者: Wilfried N. Gansterer , David Pölz
DOI: 10.1007/978-3-642-00958-7_40
关键词: Data mining 、 Phishing 、 Basis (linear algebra) 、 Contrast (statistics) 、 One-class classification 、 Rank (computer programming) 、 Binary classification 、 Computer science 、 Sequence 、 Support vector machine
摘要: We discuss a classification-based approach for filtering phishing messages in an e-mail stream. Upon arrival, various features of every are extracted. This forms the basis classification process which detects potentially harmful messages. introduce new identifying and rank established as well newly introduced according to their significance this problem. Moreover, contrast classical binary approaches (spam vs. not spam), more refined ternary data is investigated automatically distinguishes three message types: ham (solicited e-mail), spam, phishing. Experiments with representative sets illustrate that our yields better results than existing detection methods. direct proposed compared sequence two processes. Direct one-step only efficient, but also shown achieve accuracy repeated classification.