🛠️ Steven Gong
Search
Search
Search
Light mode
Dark mode
Aug 16, 2025, 1 min read
Advantage Function
A
(
s
,
a
)
=
Q
(
s
,
a
)
−
V
(
s
)
Graph View
Backlinks
High-Dimensional Continuous Control Using Generalized Advantage Estimation