摘要: This paper proposes an adaptation of Watkins' Q-learning (1989, 1992) for fuzzy inference systems where both the actions and Q-functions are inferred from rules. approach is compared with genetic algorithm on cart-centering problem, showing its effectiveness.