mlpy.mdp.discrete.LeastVisitedBonusExplorer.activate

LeastVisitedBonusExplorer.activate(qvalues=None, *args, **kwargs)[source]

Turn on exploration mode.

If it is predicted that only states with rewards less than the threshold can be reached then the agent goes into exploration mode.

Parameters:

qvalues : dict

The qvalues for all actions from the current state