🛠️ Steven Gong

Search

SearchSearch

Dec 21, 2022, 1 min read

EXP3 Algorithm

Hongrui Zheng talked about this, it is widely used in MAB problems.

https://courses.cs.washington.edu/courses/cse599s/14sp/scribes/lecture9/lecture9_draft.pdf

Graph View

Backlinks

  • FormulaZero
  • Multi-Armed Bandit (MAB)

Created with Quartz, © 2025

  • Blog
  • LinkedIn
  • Twitter
  • GitHub