Exploitability

If then is a Nash Equilibrium: no player has any incentive to deviate as they are all playing Best Responses.

If a game is two-player and zero-sum, we can use exploitability as a metric for determining how close a strategy profile is to an equilibrium, where This will be important to measure how close the strategy profile we generate is to the Nash Equilibrium.