mlpy.mdp.continuous.casml.CASML

class mlpy.mdp.continuous.casml.CASML(cbtdata, cbvdata=None, ncomponents=1, proba_calc_method=None, actions=None, n_init=1, **kwargs)[source]

Bases: mlpy.mdp.IMDPModel

Continuous Action and State Model Learner (CASML).

Parameters:

cbtdata : CbTData

The transition case base data.

cbvdata : CbVData

The value case base data.

ncomponents : int, optional

Number of states of the hidden Markov model. Default is 1.

proba_calc_method : str, optional

The method used to calculate the probability distribution for the initial states. Default is DefaultProbaCalcMethod.

actions : list or dict, optional

A list of possible discrete actions.

Attributes

cases
experience
mid The module’s unique identifier.

Methods

fit(obs, actions[, rewards, n_init]) Fit the CaseBase and the HMM.
get_actions([state]) Retrieve the available actions for the given state.
init() Initialize the MDP model.
load(filename) Load the state of the module from file.
predict_proba(state, action) Predict the probability distribution.
retrieve(case_base, state, action[, ...])
sample([state, action]) Sample from the probability distribution.
save(filename) Save the current state of the module to file.
update(experience) Update the model with the agent’s experience.