Markov Decision Process (MDP)
RL problems are usually modeled as MDPs.
S ? set of all states.
A? ? set of all actions in state ?.
Previous slide
Next slide
Back to first slide
View graphic version