Markov programming with policy constraints

European Journal of Operational Research - Tập 3 - Trang 253-255 - 1979
N.A.J. Hastings1
1Monash University, Melbourne, Australia

Tài liệu tham khảo

Hastings, 1978 Hastings, 1978, 3156 Howard, 1960 Odoni, 1969, On finding the maximal gain of a Markov decision process, Operations Res., 17, 857, 10.1287/opre.17.5.857