mlpy.mdp.discrete.LeastVisitedBonusExplorer.activate¶
-
LeastVisitedBonusExplorer.activate(qvalues=None, *args, **kwargs)[source]¶ Turn on exploration mode.
If it is predicted that only states with rewards less than the threshold can be reached then the agent goes into exploration mode.
Parameters: qvalues : dict
The qvalues for all actions from the current state