mlpy.mdp.discrete.LeastVisitedBonusExplorer

class mlpy.mdp.discrete.LeastVisitedBonusExplorer(rmax, func, thresh=None)[source]

Bases: mlpy.mdp.discrete.RMaxExplorer

Least visited bonus explorer, a RMax based exploration model.

Least visited bonus exploration only goes into exploration mode whether it is predicted that only states with rewards less than a given threshold can be reached. Once in exploration mode, states that have been visited least are given a bonus of RMax to drive exploration.

Parameters:

rmax : float

The maximum achievable reward.

func : callable

Callback function to retrieve the minimum number of times a state has been visited.

thresh : float

If all states that can be reached from the current state have a value less than the threshold, exploration mode is turned on.

Attributes

mid The module’s unique identifier.

Methods

activate([qvalues]) Turn on exploration mode.
deactivate() Turn off exploration mode.
load(filename) Load the state of the module from file.
save(filename) Save the current state of the module to file.
update(model) Update the reward model.