mlpy.mdp.discrete.UnknownBonusExplorer¶
-
class
mlpy.mdp.discrete.UnknownBonusExplorer(rmax)[source]¶ Bases:
mlpy.mdp.discrete.RMaxExplorerUnknown bonus explorer, a RMax based exploration model.
States for which the decision tree was unable to predict a reward are given a bonus of RMax to drive exploration, since these states are considered to be unknown under the model.
Parameters: rmax : float
The maximum achievable reward.
Attributes
midThe module’s unique identifier. Methods
activate(*args, **kwargs)Turn on exploration mode. deactivate()Turn off exploration mode. load(filename)Load the state of the module from file. save(filename)Save the current state of the module to file. update(model)Update the reward model.