Custom policy evaluations for Evaluation Callbacks.
Functions
evaluate_policy(model, env[, ...])
evaluate_policy
Runs policy for n_eval_episodes episodes and returns average reward.
n_eval_episodes