mlpy.learners.online.rl.QLearner¶
-
class
mlpy.learners.online.rl.QLearner(explorer=None, max_steps=None, alpha=None, gamma=None, filename=None, profile=False)[source]¶ Bases:
mlpy.learners.online.rl.RLLearnerPerforms q-learning.
Q-learning is a reinforcement learning variant.
Parameters: explorer : Explorer, optional
The exploration strategy used. Default is no exploration.
max_steps : int, optional
The maximum number of steps in an iteration. Default is 100
alpha : float, optional
The learning rate. Default is 0.5.
gamma : float, optional
The discounting factor. Default is 0.9.
filename : str, optional
The name of the file to save the learner state to after each iteration. If None is given, the learner state is not saved. Default is None.
profile : bool, optional
Turn on profiling at which point profiling data is collected and saved to a text file. Default is False.
Attributes
midThe module’s unique identifier. typeThis learner is of type online. Methods
choose_action(state)Choose the next action execute(experience)Execute learning specific updates. learn([experience])Learn a policy from the experience. load(filename)Load the state of the module from file. reset(t, **kwargs)Reset reinforcement learner. save(filename)Save the learners state.