Markov decision process (MDP) (`mlpy.mdp`)¶

Transition and reward models¶

`MDPModelFactory`	The Markov decision process (MDP) model factory.
`IMDPModel`	The Markov decision process interface.

`DiscreteModel`	The MDP model for discrete states and actions.
`DecisionTreeModel`	The MDP model for discrete states and actions realized with decision trees.

`ExplorerFactory`	The model explorer factory.
`RMaxExplorer`	RMax based exploration base class.
`LeastVisitedBonusExplorer`	Least visited bonus explorer, a RMax based exploration model.
`UnknownBonusExplorer`	Unknown bonus explorer, a RMax based exploration model.

casml Continuous Action and State Model Learner (CASML)

`ProbaCalcMethodFactory`	The probability calculation method factory.
`IProbaCalcMethod`	The Probability calculation method interface.
`DefaultProbaCalcMethod`	The default probability calculation method.
`ProbabilityDistribution`	Probability Distribution.

`Experience`	Experience base class.
`RewardFunction`	The reward function.
`MDPStateActionInfo`	The models interface.
`MDPStateData`	State information interface.
`MDPPrimitive`	A Markov decision process primitive.
`MDPState`	Representation of the state.
`MDPAction`	Representation of an action.