🛠️ Steven Gong

Search

SearchSearch

Jul 21, 2025, 1 min read

Offline RL

Batch-Constrained Deep Q-Learning (BCQ)

https://arxiv.org/abs/1812.0290

Graph View

Backlinks

  • Offline Reinforcement Learning

Created with Quartz, © 2025

  • Blog
  • LinkedIn
  • Twitter
  • GitHub