mlpy.mdp.discrete.RMaxExplorer¶

class mlpy.mdp.discrete.RMaxExplorer(rmax)[source]¶

RMax based exploration base class.

Parameters:

rmax : float

The maximum achievable reward.

Attributes

mid The module’s unique identifier.

Methods

`activate`(args, *kwargs)	Turn on exploration mode.
`deactivate`()	Turn off exploration mode.
`load`(filename)	Load the state of the module from file.
`save`(filename)	Save the current state of the module to file.
`update`(model)	Update the reward model according to a RMax based exploration policy.