SMILER: Towards Practical Online Traffic Classification

作者: Baohua Yang , Guangdong Hou , Lingyun Ruan , Yibo Xue , Jun Li

DOI: 10.1109/ANCS.2011.34

关键词:

摘要: Network traffic classification is extremely important in numerous network functions today. However, most of the current approaches based on port number or payload detection are becoming increasingly impractical with appearance dynamic encrypted applications. Even though some supervised learning work were proposed, it difficult to collect sufficient flow-labeled traces for training. On other hand, online needs an early identification, which still challenging well-known approaches. In this paper, we propose a semi-supervised approach named SMILER, supports from sizes first few packets (empirically 5 packets) flow. Experiments real networks demonstrate that SMILER achieves 94% precision and 96% recall average all tested applications, even disordered works well. With hybrid scheme, performance further improved. Meanwhile, performs fast both updating. All experimental results show practical accurate classification.

参考文章(20)
Semi-Supervised Learning Advanced Methods in Sequence Analysis Lectures. pp. 221- 232 ,(2010) , 10.7551/MITPRESS/9780262033589.001.0001
Anthony McGregor, Mark Hall, Perry Lorier, James Brunskill, Flow Clustering Using Machine Learning Techniques passive and active network measurement. ,vol. 3015, pp. 205- 214 ,(2004) , 10.1007/978-3-540-24668-8_21
John Stutz, Peter Cheeseman, Bayesian classification (AutoClass): theory and results knowledge discovery and data mining. pp. 153- 180 ,(1996)
Tom M Mitchell, None, Machine learning and data mining Communications of The ACM. ,vol. 42, pp. 30- 36 ,(1999) , 10.1145/319382.319388
Andrew W. Moore, Denis Zuev, Internet traffic classification using bayesian analysis techniques measurement and modeling of computer systems. ,vol. 33, pp. 50- 60 ,(2005) , 10.1145/1064212.1064220
Subhabrata Sen, Oliver Spatscheck, Dongmei Wang, Accurate, scalable in-network identification of p2p traffic using application signatures Proceedings of the 13th conference on World Wide Web - WWW '04. pp. 512- 521 ,(2004) , 10.1145/988672.988742
A. P. Dempster, N. M. Laird, D. B. Rubin, Maximum Likelihood from Incomplete Data Via theEMAlgorithm Journal of the Royal Statistical Society: Series B (Methodological). ,vol. 39, pp. 1- 22 ,(1977) , 10.1111/J.2517-6161.1977.TB01600.X
Laurent Bernaille, Renata Teixeira, Kave Salamatian, Early application identification conference on emerging network experiment and technology. pp. 6- ,(2006) , 10.1145/1368436.1368445
Jeffrey Erman, Anirban Mahanti, Martin Arlitt, Ira Cohen, Carey Williamson, Offline/realtime traffic classification using semi-supervised learning Performance Evaluation. ,vol. 64, pp. 1194- 1213 ,(2007) , 10.1016/J.PEVA.2007.06.014
Jin Cao, Aiyou Chen, Indra Widjaja, Nengfeng Zhou, Online Identification of Applications Using Statistical Behavior Analysis global communications conference. pp. 1- 6 ,(2008) , 10.1109/GLOCOM.2008.ECP.287