mlpy.learners.online.rl.RLLearner.learn

RLLearner.learn(experience=None)[source]

Learn a policy from the experience.

Parameters:

experience : Experience

The agent’s experience consisting of the previous state, the action performed in that state, the current state and the reward awarded.