mlpy.mdp.discrete.RMaxExplorer

class mlpy.mdp.discrete.RMaxExplorer(rmax)[source]

Bases: mlpy.modules.UniqueModule

RMax based exploration base class.

Parameters:

rmax : float

The maximum achievable reward.

Attributes

mid The module’s unique identifier.

Methods

activate(*args, **kwargs) Turn on exploration mode.
deactivate() Turn off exploration mode.
load(filename) Load the state of the module from file.
save(filename) Save the current state of the module to file.
update(model) Update the reward model according to a RMax based exploration policy.