Counterfactual Regret Minimization

Best Response

A player’s best response as a strategy that maximizes their expected payoff assuming all other players play according to .

This is really important, we need to look at the paper: Accelerating Best Response Calculation in Large Extensive Games

Given a strategy profile , we define a player ’s best response as In other words, player ’s best response is a strategy that maximizes their expected payoff assuming all other players play according to .

Personal Notes

I remember somewhere that the best response strategy is always deterministic.