mlpy.learners.online.rl.QLearner¶
-
class
mlpy.learners.online.rl.
QLearner
(explorer=None, max_steps=None, alpha=None, gamma=None, filename=None, profile=False)[source]¶ Bases:
mlpy.learners.online.rl.RLLearner
Performs q-learning.
Q-learning is a reinforcement learning variant.
Parameters: explorer : Explorer, optional
The exploration strategy used. Default is no exploration.
max_steps : int, optional
The maximum number of steps in an iteration. Default is 100
alpha : float, optional
The learning rate. Default is 0.5.
gamma : float, optional
The discounting factor. Default is 0.9.
filename : str, optional
The name of the file to save the learner state to after each iteration. If None is given, the learner state is not saved. Default is None.
profile : bool, optional
Turn on profiling at which point profiling data is collected and saved to a text file. Default is False.
Attributes
mid
The module’s unique identifier. type
This learner is of type online. Methods
choose_action
(state)Choose the next action execute
(experience)Execute learning specific updates. learn
([experience])Learn a policy from the experience. load
(filename)Load the state of the module from file. reset
(t, **kwargs)Reset reinforcement learner. save
(filename)Save the learners state.