mlpy.mdp.discrete.LeastVisitedBonusExplorer.update

LeastVisitedBonusExplorer.update(model)[source]

Update the reward model.

Update the reward model according to a RMax based exploration policy. To drive exploration a bonus of RMax is given to the least visited states.

Parameters:

model : StateActionInfo

The states-action information.