mlpy.learners.online.rl.QLearner.learn¶

QLearner.learn(experience=None)[source]¶

Learn a policy from the experience.

By updating the Q table according to the experience a policy is learned.

Parameters:

experience : Experience

The actor’s current experience consisting of previous state, the action performed in that state, the current state, and the reward awarded.