Learning to bid in bridge

作者: Asaf Amit , Shaul Markovitch

DOI: 10.1007/S10994-006-6225-2

关键词:

摘要: Bridge bidding is considered to be one of the most difficult problems for game-playing programs. It involves four agents rather than two, including a cooperative agent. In addition, partial observability game makes it impossible predict outcome each action. this paper we present new decision-making algorithm that capable overcoming these problems. The allows models used both opponent and partners, while utilizing novel model-based Monte Carlo sampling method overcome problem hidden information. also presents learning framework uses above co-training partners. refine their selection strategies during training continuously exchange refined strategies. refinement based on inductive applied examples accumulated classes states with conflicting actions. was empirically evaluated set bridge deals. pair co-trained significantly improved performance level surpassing current state-of-the-art algorithm.

参考文章(46)
Ron Klinger, Modern Losing Trick Count ,(1987)
Alan M. Stanier, BRIBIP: a bridge bidding program international joint conference on artificial intelligence. pp. 374- 378 ,(1975)
L. Finkelstein, S. Markovitch, Learning to Play Chess Selectively by Acquiring Move Patterns ICGA Journal. ,vol. 21, pp. 100- 119 ,(1998) , 10.3233/ICG-1998-21204
Bjorn Gamback, Manny Rayner, Barney Pell, Pragmatic reasoning in bridge ,(1993)
Keki B. Irani, Carol A. Luckhardt, An algorithmic solution of N-person games national conference on artificial intelligence. pp. 158- 162 ,(1986)
H.H.L.M. Donkers, NOSCE HOSTEM: Searching with Opponent Models Datawyse / Universitaire Pers Maastricht. ,(1997)
Jonathan Schaeffer, The Games Computers (and People) Play national conference on artificial intelligence. ,vol. 52, pp. 1179- ,(2000) , 10.1016/S0065-2458(00)80019-4
Wojciech Jamroga, A Defense Model for Games with Incomplete Information Lecture Notes in Computer Science. pp. 260- 274 ,(2001) , 10.1007/3-540-45422-5_19
Nathan Sturtevant, Current challenges in multi-player game search annual conference on computers. pp. 285- 300 ,(2004) , 10.1007/11674399_20