Q-Learning: From Computer Network Security to Software Security

作者: Arisoa S. Randrianasolo , Larry D. Pyeatt

DOI: 10.1109/ICMLA.2014.47

关键词:

摘要: Reinforcement learning techniques become more popular in computer network security. The same reinforcement developed for security can be applied to software as well. This research summarizes a work progress attempt incorporate Q-learning algorithm method is embedded part of the itself provide mechanism that has ability learn by develop temporary repair mechanism. results experiment express given right parameters and setting approach rapidly learns block all malicious actions. Data analysis on Q-values produced diagnostic A larger scale expected seen future work.

参考文章(14)
Christopher J. C. H. Watkins, Peter Dayan, Technical Note : \cal Q -Learning Machine Learning. ,vol. 8, pp. 279- 292 ,(1992) , 10.1007/BF00992698
Richard N. Taylor, Jie Ren, A Secure Software Architecture Description Language ,(2005)
Thinh Nguyen, Alan Fern, Scott Dejmal, Reinforcement learning for vulnerability assessment in peer-to-peer networks innovative applications of artificial intelligence. pp. 1655- 1662 ,(2008)
Christopher J.C.H. Watkins, Peter Dayan, Technical Note Q-Learning Machine Learning. ,vol. 8, pp. 279- 292 ,(1992) , 10.1023/A:1022676722315
Kai Chen, Yifeng Lian, Yingjun Zhang, Automatically generating patch in binary programs using attribute-based taint analysis international conference on information and communication security. pp. 367- 382 ,(2010) , 10.1007/978-3-642-17650-0_26
Xuxian Jiang, Li Xie, Bing Mao, Dongyan Xu, Zhiqiang Lin, AutoPaG: towards automated software patch generation with source code root cause identification and repair computer and communications security. pp. 329- 340 ,(2007) , 10.1145/1229285.1267001
G.Prem Kumar, P. Venkataram, Review: Artificial intelligence approaches to network management: recent advances and a survey Computer Communications. ,vol. 20, pp. 1313- 1322 ,(1997) , 10.1016/S0140-3664(97)00094-7
Jeff H. Perkins, Greg Sullivan, Weng-Fai Wong, Yoav Zibin, Michael D. Ernst, Martin Rinard, Sunghun Kim, Sam Larsen, Saman Amarasinghe, Jonathan Bachrach, Michael Carbin, Carlos Pacheco, Frank Sherwood, Stelios Sidiroglou, Automatically patching errors in deployed software symposium on operating systems principles. pp. 87- 102 ,(2009) , 10.1145/1629575.1629585
A.G. Barto, R.S. Sutton, Reinforcement Learning: An Introduction ,(1988)