mlpy.mdp.discrete.UnknownBonusExplorer¶
-
class
mlpy.mdp.discrete.
UnknownBonusExplorer
(rmax)[source]¶ Bases:
mlpy.mdp.discrete.RMaxExplorer
Unknown bonus explorer, a RMax based exploration model.
States for which the decision tree was unable to predict a reward are given a bonus of RMax to drive exploration, since these states are considered to be unknown under the model.
Parameters: rmax : float
The maximum achievable reward.
Attributes
mid
The module’s unique identifier. Methods
activate
(*args, **kwargs)Turn on exploration mode. deactivate
()Turn off exploration mode. load
(filename)Load the state of the module from file. save
(filename)Save the current state of the module to file. update
(model)Update the reward model.