NTCS: A real time flow-based network traffic classification system

作者: Silas Santiago Lopes Pereira , Jorge Luiz De Castro e Silva , Jose Everardo Bessa Maia

DOI: 10.1109/CNSM.2014.7014196

关键词: Data miningBottleneckPreprocessorPacket analyzerDecision treeComputer scienceNaive Bayes classifierEnsemble learningTraffic classificationAdaBoost

摘要: This work presents the design and implementation of a real time flow-based network traffic classification system. The classifier monitor acts as pipeline consisting three modules: packet capture preprocessing, flow reassembly, with Machine Learning (ML). modules are built concurrent processes well defined data interfaces between them so that any module can be improved updated independently. In this pipeline, reassembly function becomes bottleneck performance. implementation, was used efficient method which results in average delivery delay 0.49 seconds, aproximately. For module, performances K-Nearest Neighbor (KNN), C4.5 Decision Tree, Naive Bayes (NB), Flexible (FNB) AdaBoost Ensemble Algorithm compared order to validate our approach.

参考文章(30)
Benoit Claise, Cisco Systems NetFlow Services Export Version 9 RFC. ,vol. 3954, pp. 1- 33 ,(2004)
Denis Zuev, Andrew W. Moore, Traffic Classification Using a Statistical Approach Lecture Notes in Computer Science. pp. 321- 324 ,(2005) , 10.1007/978-3-540-31966-5_25
Roni Bar - Yanai, Michael Langberg, David Peleg, Liam Roditty, Realtime classification for encrypted traffic symposium on experimental and efficient algorithms. pp. 373- 385 ,(2010) , 10.1007/978-3-642-13193-6_32
Mark A. Hall, Ian H. Witten, Eibe Frank, Data Mining: Practical Machine Learning Tools and Techniques ,(1999)
George H. John, Pat Langley, Estimating continuous distributions in Bayesian classifiers uncertainty in artificial intelligence. pp. 338- 345 ,(1995)
Mohammed J. Islam , Q. M. Jonathan Wu , Majid Ahmadi , Maher A. SidAhmed , Investigating the Performance of Naive- Bayes Classifiers and K- Nearest Neighbor Classifiers Journal of Convergence Information Technology. ,vol. 5, pp. 133- 137 ,(2010) , 10.4156/JCIT.VOL5.ISSUE2.15
Peter Siska, Marc Ph. Stoecklin, Andreas Kind, Torsten Braun, A flow trace generator using graph-based traffic classification techniques Proceedings of the 6th International Wireless Communications and Mobile Computing Conference on ZZZ - IWCMC '10. pp. 457- 462 ,(2010) , 10.1145/1815396.1815503
Hyunchul Kim, KC Claffy, Marina Fomenkov, Dhiman Barman, Michalis Faloutsos, KiYoung Lee, Internet traffic classification demystified: myths, caveats, and the best practices conference on emerging network experiment and technology. pp. 11- ,(2008) , 10.1145/1544012.1544023
Andrew W. Moore, Denis Zuev, Internet traffic classification using bayesian analysis techniques measurement and modeling of computer systems. ,vol. 33, pp. 50- 60 ,(2005) , 10.1145/1064212.1064220