mlpy.mdp.discrete.UnknownBonusExplorer¶

class mlpy.mdp.discrete.UnknownBonusExplorer(rmax)[source]¶

Bases: mlpy.mdp.discrete.RMaxExplorer

Unknown bonus explorer, a RMax based exploration model.

States for which the decision tree was unable to predict a reward are given a bonus of RMax to drive exploration, since these states are considered to be unknown under the model.

Parameters:

rmax : float

The maximum achievable reward.

Attributes

mid The module’s unique identifier.

Methods

`activate`(args, *kwargs)	Turn on exploration mode.
`deactivate`()	Turn off exploration mode.
`load`(filename)	Load the state of the module from file.
`save`(filename)	Save the current state of the module to file.
`update`(model)	Update the reward model.