Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics

作者: Amir Mosavi , Pedram Ghamisi , Yaser Faghan , Puhong Duan , Shahab Shamshirband

DOI: 10.20944/PREPRINTS202003.0309.V1

关键词:

摘要: The popularity of deep reinforcement learning (DRL) methods in economics have been exponentially increased. DRL through a wide range capabilities from (RL) and (DL) for handling sophisticated dynamic business environments offers vast opportunities. is characterized by scalability with the potential to be applied high-dimensional problems conjunction noisy nonlinear patterns economic data. In this work, we first consider brief review DL, RL, RL diverse applications providing an in-depth insight into state art. Furthermore, architecture investigated order highlight complexity, robustness, accuracy, performance, computational tasks, risk constraints, profitability. survey results indicate that can provide better performance higher accuracy as compared traditional algorithms while facing real at presence parameters ever-increasing uncertainties.

参考文章(101)
Adnan Haider, Muhammad Nadeem Hanif, INFLATION FORECASTING IN PAKISTAN USING ARTIFICIAL NEURAL NETWORKS Research Papers in Economics. ,(2007)
Roland Hafner, Martin Riedmiller, Reinforcement learning in feedback control Machine Learning. ,vol. 84, pp. 137- 169 ,(2011) , 10.1007/S10994-011-5235-X
Çaglar Gülçehre, Yoshua Bengio, Yoshua Bengio, Yoshua Bengio, KyungHyun Cho, Junyoung Chung, Empirical evaluation of gated recurrent neural networks on sequence modeling arXiv: Neural and Evolutionary Computing. ,(2014)
Thomas B. Schön, Niklas Wahlström, Marc Peter Deisenroth, From Pixels to Torques: Policy Learning with Deep Dynamical Models arXiv: Machine Learning. ,(2015)
John Moody, Lizhong Wu, Yuansong Liao, Matthew Saffell, Performance functions and reinforcement learning for trading systems and portfolios Journal of Forecasting. ,vol. 17, pp. 441- 470 ,(1998) , 10.1002/(SICI)1099-131X(1998090)17:5/6<441::AID-FOR707>3.0.CO;2-#
Stelios D. Bekiros, Heterogeneous trading strategies with adaptive fuzzy Actor–Critic reinforcement learning: A behavioral approach Journal of Economic Dynamics and Control. ,vol. 34, pp. 1153- 1170 ,(2010) , 10.1016/J.JEDC.2010.01.015
Yi Peng, Gang Kou, Yong Shi, Zhengxin Chen, A Multi-criteria Convex Quadratic Programming model for credit data analysis decision support systems. ,vol. 44, pp. 1016- 1030 ,(2008) , 10.1016/J.DSS.2007.12.001
Alejandro M. Manelli, Daniel R. Vincent, Bundling as an optimal selling mechanism for a multiple-good monopolist Journal of Economic Theory. ,vol. 127, pp. 1- 35 ,(2006) , 10.1016/J.JET.2005.08.007
Ronald J. Williams, David Zipser, A learning algorithm for continually running fully recurrent neural networks Neural Computation. ,vol. 1, pp. 270- 280 ,(1989) , 10.1162/NECO.1989.1.2.270
Jigar Patel, Sahil Shah, Priyank Thakkar, K Kotecha, Predicting stock and stock price index movement using Trend Deterministic Data Preparation and machine learning techniques Expert Systems With Applications. ,vol. 42, pp. 259- 268 ,(2015) , 10.1016/J.ESWA.2014.07.040