mlpy.mdp.discrete.LeastVisitedBonusExplorer.update¶

LeastVisitedBonusExplorer.update(model)[source]¶

Update the reward model.

Update the reward model according to a RMax based exploration policy. To drive exploration a bonus of RMax is given to the least visited states.