mlpy.mdp.discrete.UnknownBonusExplorer

class mlpy.mdp.discrete.UnknownBonusExplorer(rmax)[source]

Bases: mlpy.mdp.discrete.RMaxExplorer

Unknown bonus explorer, a RMax based exploration model.

States for which the decision tree was unable to predict a reward are given a bonus of RMax to drive exploration, since these states are considered to be unknown under the model.

Parameters:

rmax : float

The maximum achievable reward.

Attributes

mid The module’s unique identifier.

Methods

activate(*args, **kwargs) Turn on exploration mode.
deactivate() Turn off exploration mode.
load(filename) Load the state of the module from file.
save(filename) Save the current state of the module to file.
update(model) Update the reward model.