Monte-Carlo vs. Temporal Difference
David Silver talks about this Bias - Variance Tradeoff.
In MC, we have low bias, but high variance. In TD(0), we have high bias, but low variance.
TD() tries to combine the best out of both worlds.
David Silver talks about this Bias - Variance Tradeoff.
In MC, we have low bias, but high variance. In TD(0), we have high bias, but low variance.
TD() tries to combine the best out of both worlds.