Sampling (Reinforcement Learning)
Sampling: the update samples an expectation Doing a sample backup, rather than exhaustively going through all rollouts before updating.
- DP does not sample
- MC Samples
- TD Samples
Sampling: the update samples an expectation Doing a sample backup, rather than exhaustively going through all rollouts before updating.