mlpy.planners.discrete.ValueIteration.get_next_action¶
-
ValueIteration.
get_next_action
(state, use_policy=False)¶ Returns the optimal action for a state according to the current policy.
Parameters: state : State
The state for which to choose the next action for.
use_policy : bool, optional
When using a policy the next action is chosen according to the current policy, otherwise the best action is selected. Default is False.
Returns: Action :
The next action.