mlpy.mdp.stateaction.StateActionInfo¶
-
class
mlpy.mdp.stateaction.
StateActionInfo
[source]¶ Bases:
object
The models interface.
Contains all relevant information predicted by a model for a given state-action pair. This includes the (predicted) reward and transition probabilities to possible next states.
Attributes
transition_proba (ProbabilityDistribution) The transition probability distribution. reward_func (RewardFunction) The reward function. visits (int) The number of times the state-action pair has been visited. known (bool) Flag indicating whether a reward value is known or not.