🛠️ Steven Gong

Search

Jul 21, 2025, 1 min read

Greedy in the Limit with Infinite Exploration (GLIE)

Greedy in the Limit with Infinite Exploration (GLIE)

All state-action pairs are explored infinitely many times, $lim_{k \to \infty} N_{k} (s, a) = \infty$

The policy converges on a greedy policy, $lim_{n \to \infty} π_{k} (a ∣ s) = 1 (a = a^{'} \in A argmax Q_{k} (s, a^{'}))$

I initially undersold how important this is, but this is EXTREMELY important to understand.

We use this GLIE idea for Monte-Carlo Control.

Graph View

Backlinks

Bellman Equation
Epsilon-Greedy
Model-Free Control
Monte-Carlo Control

Created with Quartz, © 2026

Blog
LinkedIn
Twitter
GitHub