mlpy.mdp.discrete.LeastVisitedBonusExplorer.activate¶

LeastVisitedBonusExplorer.activate(qvalues=None, *args, **kwargs)[source]¶

Turn on exploration mode.

If it is predicted that only states with rewards less than the threshold can be reached then the agent goes into exploration mode.