Journal of Applied Sciences ›› 2000, Vol. 18 ›› Issue (4): 335-339.

• Articles • Previous Articles     Next Articles

Parameter Decision Making in Adaptive Markov Decision Process with Finite Planning Horizon

LI Jiang-hong, HAN Zheng-zhi   

  1. Institute of Intelligence Engineering, Shanghai Jiaotong University, Shanghai 200030, China
  • Received:1999-11-17 Revised:2000-01-17 Online:2000-12-31 Published:2000-12-31

Abstract: An algorithm is proposed for adaptive MDP with finite planning horizon by reason of the fact that all current algorithms only consider adaptive MDP with infinite planning horizon. Bayes principle is applied to learn an unknown system; and for every decision the probability that the actual decision equals the optimal decision is maximized. Simulation results demonstrate the validity of the new algorithm.

Key words: Markov decision process (MDP), adaptive decision making, Bayes principle

CLC Number: