mlpy.planners.discrete.ValueIteration.choose_action¶

ValueIteration.choose_action(state, use_policy=False)¶

Choose the optimal action for a state according to the current policy.

Parameters:

state : MDPState

The state for which to choose the next action for.

use_policy : bool, optional

When using a policy the next action is chosen according to the current policy, otherwise the best action is selected. Default is False.

Returns:

MDPAction :

The next action.