mlpy.planners.discrete.ValueIteration.get_next_action¶

ValueIteration.get_next_action(state, use_policy=False)¶

Returns the optimal action for a state according to the current policy.

Parameters:

state : State

The state for which to choose the next action for.

use_policy : bool, optional

When using a policy the next action is chosen according to the current policy, otherwise the best action is selected. Default is False.

Returns:

Action :

The next action.