mlpy.learners.online.rl.QLearner.step¶

QLearner.step(experience)[source]¶

Execute learning specific updates.

Learning specific updates are performed, e.g. model updates.

Parameters:

experience : Experience

The actor’s current experience consisting of previous state, the action performed in that state, the current state, and the reward awarded.