🛠️ Steven Gong

Search

Aug 29, 2025, 1 min read

Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning

Builds on top of Conservative Q-Learning.

I watched Sergey’s talk at RLC2024, where he talks about there’s a big gap with offline-RL and online-RL, and Cal-QL tries to close this gap.

Graph View

Backlinks

Offline Reinforcement Learning
Policy Extraction
Conservative Q-Learning for Offline Reinforcement Learning

Created with Quartz, © 2026

Blog
LinkedIn
Twitter
GitHub