mlpy.learners.online.rl.QLearner.choose_action¶

QLearner.choose_action(state)[source]¶

Choose the next action

The next action is chosen according to the current policy and the selected exploration strategy.

Parameters:

state : MDPState

The current state.

Returns:

MDPAction :

The chosen action.