作者: Robert Holte , John Hawkin , Duane Szafron
DOI:
关键词: Perfect information 、 Decision problem 、 Limit (mathematics) 、 Mathematical optimization 、 Action (philosophy) 、 Extensive-form game 、 Focus (computing) 、 Space (commercial competition) 、 Computer science 、 Value (ethics) 、 Artificial intelligence
摘要: Multi-agent decision problems can often be formulated as extensive-form games. We focus on imperfect information games in which one or more actions at many points have an associated continuous many-valued parameter. A stock trading agent, addition to deciding whether buy not, must decide how much buy. In no-limit poker, selecting a probability for each action, the agent bet betting action. Selecting values these parameters makes extremely large. Two-player Texas Hold'em poker with stacks of 500 big blinds has approximately 1071 states, is than 1050 times states two-player limit Hold'em. The main contribution this paper technique that abstracts game's action space by one, small number, show strategies computed using new algorithm Leduc exhibit significant utility gains over e-Nash equilibrium standard, hand-crafted parameter value abstractions.