mlpy.learners.offline.irl.IncrApprenticeshipLearner.learn

IncrApprenticeshipLearner.learn()[source]

Learn a policy from the experience.

Learn the optimal policy using an apprenticeship learning algorithm incrementally.

Returns:

bool :

Whether the found policy is considered to have converged. The algorithm is considered to have converged on the optimal policy if either the performance is within a certain threshold or if the maximum number of iterations has been reached.