mlpy.learners.online.rl.QLearner.step¶
-
QLearner.
step
(experience)[source]¶ Execute learning specific updates.
Learning specific updates are performed, e.g. model updates.
Parameters: experience : Experience
The actor’s current experience consisting of previous state, the action performed in that state, the current state, and the reward awarded.