Abstract
Let the state space S be a Borel subset of a complete separable metric space, the action space A compact metric. Existence of stationary optimal policies is proved for general semi-Markov models with possibly unbounded rewards. The corresponding dynamic programming equations are also derived. The paper presents a synthesis and extensions of earlier results.
| Original language | English (US) |
|---|---|
| Pages (from-to) | 365-381 |
| Number of pages | 17 |
| Journal | Journal of Statistical Planning and Inference |
| Volume | 21 |
| Issue number | 3 |
| DOIs | |
| State | Published - Mar 1989 |
| Externally published | Yes |
Keywords
- Stationary optimal policy
- discounted reward
- dynamic programming equation
- invariant distribution
ASJC Scopus subject areas
- Statistics and Probability
- Statistics, Probability and Uncertainty
- Applied Mathematics