Feature Engineering for Deep Reinforcement Learning Based Routing

作者: Jose Suarez-Varela , Albert Mestres , Junlin Yu , Li Kuang , Haoyu Feng

DOI: 10.1109/ICC.2019.8761276

关键词: Representation (mathematics)Routing (electronic design automation)Quality of serviceFeature engineeringDistributed computingNetwork topologyAction (philosophy)Reinforcement learningState (computer science)Computer science

摘要: Recent advances in Deep Reinforcement Learning (DRL) techniques are providing a dramatic improvement decision-making and automated control problems. As result, we witnessing growing number of research works that proposing ways applying DRL to network-related problems such as routing. However, proposals failed achieve good results, often under-performing traditional routing techniques. We argue successfully DRL-based networking requires finding representations the network parameters: feature engineering. agents need represent both state (e.g., link utilization) action space changes policy). In this paper, show existing approaches use straightforward lead poor performance. propose novel representation outperforms ones is flexible enough be applied many use-cases. test our two different scenarios: (i) optical transport networks (ii) QoS-aware IP networks. Our results agent achieves significantly better performance compared state/action representations.

参考文章(12)
John Schulman, None, Trust Region Policy Optimization international conference on machine learning. pp. 1889- 1897 ,(2015)
Hamzeh Beyranvand, Jawad A. Salehi, A Quality-of-Transmission Aware Dynamic Routing and Spectrum Assignment Scheme for Future Elastic Optical Networks Journal of Lightwave Technology. ,vol. 31, pp. 3043- 3054 ,(2013) , 10.1109/JLT.2013.2278572
A. Medina, N. Taft, K. Salamatian, S. Bhattacharyya, C. Diot, Traffic matrix estimation Proceedings of the 2002 conference on Applications, technologies, architectures, and protocols for computer communications - SIGCOMM '02. ,vol. 32, pp. 161- 174 ,(2002) , 10.1145/633025.633041
Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, Demis Hassabis, None, Human-level control through deep reinforcement learning Nature. ,vol. 518, pp. 529- 533 ,(2015) , 10.1038/NATURE14236
Joao Pedro, Joao Santos, Joao Pires, Performance evaluation of integrated OTN/DWDM networks with single-stage multiplexing of optical channel data units international conference on transparent optical networks. pp. 1- 4 ,(2011) , 10.1109/ICTON.2011.5970940
Shih-Chun Lin, Ian F. Akyildiz, Pu Wang, Min Luo, QoS-Aware Adaptive Routing in Multi-layer Hierarchical Software Defined Networks: A Reinforcement Learning Approach 2016 IEEE International Conference on Services Computing (SCC). pp. 25- 33 ,(2016) , 10.1109/SCC.2016.12
Liang Guo, I. Matta, The war between mice and elephants international conference on network protocols. ,vol. 2001, pp. 180- 188 ,(2001) , 10.1109/ICNP.2001.992898
Bomin Mao, Zubair Md. Fadlullah, Fengxiao Tang, Nei Kato, Osamu Akashi, Takeru Inoue, Kimihiro Mizutani, Routing or Computing? The Paradigm Shift Towards Intelligent Computer Network Packet Transmission Based on Deep Learning IEEE Transactions on Computers. ,vol. 66, pp. 1946- 1960 ,(2017) , 10.1109/TC.2017.2709742
David Sanchez-Charles, Victor Muntes-Mulero, Albert Cabellos, Marta Arias, Giorgio Stampa, A Deep-Reinforcement Learning Approach for Software-Defined Networking Routing Optimization arXiv: Networking and Internet Architecture. ,(2017)
Asaf Valadarsky, Michael Schapira, Dafna Shahaf, Aviv Tamar, Learning to Route hot topics in networks. pp. 185- 191 ,(2017) , 10.1145/3152434.3152441