🛠️ Steven Gong

Search

SearchSearch
  • Entropy-Regularized Reinforcement Learning
  • Related

Jul 24, 2025, 1 min read

Reinforcement Learning

Entropy-Regularized Reinforcement Learning

π∗=argmaxπ​Eτ∼π​∑t=0∞​γt(R(st​,at​,st+1​)+αH(π(⋅∣st​))),

Related

  • n-step Reinforcement Learning

Graph View

Backlinks

  • No backlinks found

Created with Quartz, © 2025

  • Blog
  • LinkedIn
  • Twitter
  • GitHub