🛠️ Steven Gong

Search

n-step Reinforcement Learning
Related

Feb 11, 2026, 1 min read

n-step Reinforcement Learning

Really had to deeply understand this as I started reading the Reinforcement Learning with Action Chunking paper, where they talked about bias.

https://gibberblot.github.io/rl-notes/single-agent/n-step.html

You can do n-step RL for lots of things. But in RL, we generally do 1-step RL.

Instead of $r + V

Not the same thing as td lambda!!!

TODO: look into the actual differences

Related

Bias - Variance Tradeoff
Bellman Update

Graph View

Backlinks

Bellman Equation
Entropy-Regularized Reinforcement Learning
Q-Learning
Temporal-Difference Learning (TD Learning)

Created with Quartz, © 2026

Blog
LinkedIn
Twitter
GitHub