Sampling (Reinforcement Learning)

Sampling: the update samples an expectation Doing a sample backup, rather than exhaustively going through all rollouts before updating.
- DP does not sample
- MC Samples
- TD Samples

Sampling: the update samples an expectation Doing a sample backup, rather than exhaustively going through all rollouts before updating.