Sampling (Reinforcement Learning)

Sampling: the update samples an expectation Doing a sample backup, rather than exhaustively going through all rollouts before updating.

  • DP does not sample
  • MC Samples
  • TD Samples