![](http://science-h.com/sh/assets/temas/umad/img/tema/globe.png)
This paper presents a novel algorithm for learning in a class of stochastic Markov decision processes (MDPs) with continuous state and action spaces that trades speed for accuracy. A transform of the stochastic MDP into a deterministic one is presented…
|