Policy Gradient Methods Deep Deterministic Policy Gradient (DDPG) More sample efficient. Resources Lecture 5: DDPG and SAC from Deep RL Foundations, slides here Related