mlpy.mdp.IMDPModel¶
-
class
mlpy.mdp.
IMDPModel
(proba_calc_method=None)[source]¶ Bases:
mlpy.modules.UniqueModule
The Markov decision process interface.
All Markov decision process (MDP) models are derived from the base class. The base class maintains an initial probability distribution from which the initial state can be sampled.
Parameters: proba_calc_method : str
The method used to calculate the probability distribution for the initial state. Defaults to DefaultProbaCalcMethod.
Attributes
mid
The module’s unique identifier. Methods
fit
(obs, actions, **kwargs)Fit the model to the observations and actions of the trajectory. init
()Initialize the MDP model. load
(filename)Load the state of the module from file. predict_proba
(state, action)Predict the probability distribution. sample
([state, action])Sample from the probability distribution. save
(filename)Save the current state of the module to file. update
(experience)Update the model with the agent’s experience.