First Steps Towards an Intelligent Laser Welding Architecture Using Deep Neural Networks and Reinforcement Learning

作者: Johannes Günther , Patrick M. Pilarski , Gerhard Helfrich , Hao Shen , Klaus Diepold

DOI: 10.1016/J.PROTCY.2014.09.007

关键词:

摘要: Abstract To address control difficulties in laser welding, we propose the idea of a self-learning and self-improving welding system that combines three modern machine learning techniques. We first show ability deep neural network to extract meaningful, low-dimensional features from high-dimensional laser-welding camera data. These are then used by temporal-difference algorithm predict anticipate important aspects system's sensor The third part our proposed architecture suggests using these predictions learn deliver situation-appropriate power; preliminary results demonstrated simulator. intelligent introduced this work has capacity improve its performance without further human assistance therefore addresses key requirements industry. knowledge, it is combination Nexting with general value functions also usage for specifically production engineering general. This provides unique example how can be explicitly learned reinforcement support welding. believe would straightforward adapt approach other applications.

参考文章(29)
Paul Herbert Zipkin, The Limits of Mass Customization MIT Sloan Management Review. ,vol. 42, pp. 81- 87 ,(2001)
Antonie van den Bogert, Michael Branicky, Philip Thomas, Kathleen Jagodnik, Application of the Actor-Critic Architecture to Functional Electrical Stimulation Control of a Human Arm. innovative applications of artificial intelligence. ,vol. 2009, pp. 165- 172 ,(2009)
James H Gilmore, B Joseph Pine, The four faces of mass customization. Harvard Business Review. ,vol. 75, pp. 91- 101 ,(1997)
Feng Liu, Chai Quek, Geok See Ng, Neural network model for time series prediction by reinforcement learning international joint conference on neural network. ,vol. 2, pp. 809- 814 ,(2005) , 10.1109/IJCNN.2005.1555956
H. Kimura, T. Yamashita, S. Kobayashi, Reinforcement learning of walking behavior for a four-legged robot conference on decision and control. ,vol. 1, pp. 411- 416 ,(2001) , 10.1109/CDC.2001.980135
Volodymyr Mnih, Ioannis Antonoglou, Koray Kavukcuoglu, Daan Wierstra, Martin A. Riedmiller, Alex Graves, David Silver, Playing Atari with Deep Reinforcement Learning arXiv: Learning. ,(2013)
Yoshua Bengio, Practical recommendations for gradient-based training of deep architectures Neural Networks: Tricks of the Trade (2nd ed.). pp. 437- 478 ,(2012) , 10.1007/978-3-642-35289-8_26
Antonio Ancona, Vincenzo Spagnolo, Pietro Mario Lugarà, Michele Ferrara, Optical sensor for real-time monitoring of CO 2 laser welding process Applied Optics. ,vol. 40, pp. 6019- 6025 ,(2001) , 10.1364/AO.40.006019
Sascha Lange, Martin Riedmiller, Arne Voigtlander, Autonomous reinforcement learning on raw visual input data in a real world application international joint conference on neural network. pp. 1- 8 ,(2012) , 10.1109/IJCNN.2012.6252823
Hu Min-ying, Shi-Junwei, Cai-jinjin Li-xin, Xing-yanqiu, The research of welding parameters on weld shape in the laser deep penetration welding international conference on mechanic automation and control engineering. pp. 3734- 3737 ,(2010) , 10.1109/MACE.2010.5535453