An algorithmic approach to identify irrelevant information in sequential teams

作者: Aditya Mahajan , Sekhar Tatikonda

DOI: 10.1016/J.AUTOMATICA.2015.08.002

关键词: Machine learningGraph (abstract data type)Bayesian networkComputer scienceMarkov decision theorySystem dynamicsDirected acyclic graphArtificial intelligenceGraphical model

摘要: An algorithmic framework that identifies irrelevant data (i.e., may be ignored without any loss of optimality) at agents a sequential team is presented. This relies on capturing the properties do not depend specifics state spaces, probability law, system dynamics, or cost functions. To capture these notion form developed. A then modeled as directed acyclic graph and identified using D-separation specific subsets nodes in graph. provides an procedure for identifying ignoring agents, thereby simplifying control laws need to implemented.

参考文章(27)
Roy Radner, Jacob Marschak, Economic theory of teams ,(1972)
H. S. Witsenhausen, The Intrinsic Model for Discrete Stochastic Control: Some Open Problems Springer, Berlin, Heidelberg. pp. 322- 335 ,(1975) , 10.1007/978-3-642-46317-4_24
Ashutosh Nayyar, Aditya Mahajan, Demosthenis Teneketzis, Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach IEEE Transactions on Automatic Control. ,vol. 58, pp. 1644- 1658 ,(2013) , 10.1109/TAC.2013.2239000
A. Gattami, Generalized Linear Quadratic Control IEEE Transactions on Automatic Control. ,vol. 55, pp. 131- 136 ,(2010) , 10.1109/TAC.2009.2033736
M. Aicardi, F. Davoli, R. Minciardi, Decentralized optimal control of Markov chains with a common past information set IEEE Transactions on Automatic Control. ,vol. 32, pp. 1028- 1031 ,(1987) , 10.1109/TAC.1987.1104483
David Blackwell, Memoryless Strategies in Finite-Stage Dynamic Programming Annals of Mathematical Statistics. ,vol. 35, pp. 863- 865 ,(1964) , 10.1214/AOMS/1177703586
H. S. Witsenhausen, Equivalent stochastic control problems Mathematics of Control, Signals, and Systems. ,vol. 1, pp. 3- 11 ,(1988) , 10.1007/BF02551232
Dan Geiger, Thomas Verma, Judea Pearl, Identifying independence in bayesian networks Networks. ,vol. 20, pp. 507- 534 ,(1990) , 10.1002/NET.3230200504
H. S. Witsenhausen, On Information Structures, Feedback and Causality Siam Journal on Control. ,vol. 9, pp. 149- 160 ,(1971) , 10.1137/0309013