pForest: In-Network Inference with Random Forests.

作者: Laurent Vanbever , Roland Meier , Alexander Dietmüller , Tobias Bühler , Coralie Busse-Grawitz

DOI:

关键词:

摘要: The concept of "self-driving networks" has recently emerged as a possible solution to manage the ever-growing complexity modern network infrastructures. In self-driving network, devices adapt their decisions in real-time by observing traffic and performing in-line inference according machine learning models. recent advent programmable data planes gives us unique opportunity implement this vision. One open question though is whether these are powerful enough run such complex tasks? We answer positively presenting pForest, system for in-network supervised models on top planes. key challenge design classification that fit constraints (e.g., no floating points, loops, limited memory) while providing high accuracy. pForest addresses three phases: (i) it optimizes features selection capabilities devices; (ii) trains random forest tailored different phases flow; (iii) applies real time, per-packet basis. fully implemented Python (training), P4_16 (inference). Our evaluation shows can classify at line rate hundreds thousands flows, with an accuracy on-par software-based solutions. We further show practicality deploying existing hardware (Barefoot Tofino).

参考文章(31)
Tristan Groléat, Sandrine Vaton, Matthieu Arzel, High-speed flow-based classification on FPGA International Journal of Network Management. ,vol. 24, pp. 253- 271 ,(2014) , 10.1002/NEM.1863
Hans-Peter Kriegel, Martin Ester, Jörg Sander, Xiaowei Xu, A density-based algorithm for discovering clusters in large spatial Databases with Noise knowledge discovery and data mining. pp. 226- 231 ,(1996)
Tin Kam Ho, Random decision forests international conference on document analysis and recognition. ,vol. 1, pp. 278- 282 ,(1995) , 10.1109/ICDAR.1995.598994
Lizhi Peng, Bo Yang, Yuehui Chen, Zhenxiang Chen, Effectiveness of Statistical Features for Early Stage Internet Traffic Identification International Journal of Parallel Programming. ,vol. 44, pp. 181- 197 ,(2016) , 10.1007/S10766-014-0337-2
Akers, Binary Decision Diagrams IEEE Transactions on Computers. ,vol. 27, pp. 509- 516 ,(1978) , 10.1109/TC.1978.1675141
Pat Bosshart, Dan Daly, Glen Gibb, Martin Izzard, Nick McKeown, Jennifer Rexford, Cole Schlesinger, Dan Talayco, Amin Vahdat, George Varghese, David Walker, P4: programming protocol-independent packet processors acm special interest group on data communication. ,vol. 44, pp. 87- 95 ,(2014) , 10.1145/2656877.2656890
F. Gringoli, Luca Salgarelli, M. Dusi, N. Cascarano, F. Risso, k. c. claffy, GT: picking up the truth from the ground for internet traffic acm special interest group on data communication. ,vol. 39, pp. 12- 18 ,(2009) , 10.1145/1629607.1629610
Bela Hullar, Sandor Laki, Andras Gyorgy, Early Identification of Peer-to-Peer Traffic 2011 IEEE International Conference on Communications (ICC). pp. 1- 6 ,(2011) , 10.1109/ICC.2011.5963023
Samoda Gamage, Ajith Pasqual, High performance parallel packet Classification architecture with Popular Rule Caching international conference on networks. pp. 52- 57 ,(2012) , 10.1109/ICON.2012.6506533
Yaxuan Qi, Jeffrey Fong, Weirong Jiang, Bo Xu, Jun Li, Viktor Prasanna, Multi-dimensional packet classification on FPGA: 100 Gbps and beyond field-programmable technology. pp. 241- 248 ,(2010) , 10.1109/FPT.2010.5681492