🛠️ Steven Gong

Search

Aug 16, 2025, 1 min read

Advantage Function

$A (s, a) = Q (s, a) - V (s)$

Graph View

Backlinks

High-Dimensional Continuous Control Using Generalized Advantage Estimation

Created with Quartz, © 2026

Blog
LinkedIn
Twitter
GitHub