Linear Programming and Markov Decision Chains

Management Science - Tập 25 Số 4 - Trang 352-362 - 1979
Arie Hordijk1, L. C. M. Kallenberg1
1University of Leiden, The Netherlands

Tóm tắt

In this paper we show that for a finite Markov decision process an average optimal policy can be found by solving only one linear programming problem. Also the relation between the set of feasible solutions of the linear program and the set of stationary policies is analyzed.

Từ khóa


Tài liệu tham khảo