High-Dimensional Continuous Control Using Generalized Advantage Estimation
Heard from the spinning up
- “For a more detailed treatment of this topic, you should read the paper on Generalized Advantage Estimation (GAE), which goes into depth about different choices of in the background sections.”
- https://spinningup.openai.com/en/latest/spinningup/rl_intro3.html#id16
There are a few that we could choose:
- where is a Reward-to-go
- This is know as the Advantage Function