🛠️ Steven Gong
Search
Search
Search
Light mode
Dark mode
Feb 07, 2026, 1 min read
Advantage Function
A
(
s
,
a
)
=
Q
(
s
,
a
)
−
V
(
s
)
Graph View
Backlinks
High-Dimensional Continuous Control Using Generalized Advantage Estimation