ReBeL

ReBeL essentially combines the idea of RL with the game theory CFR. MCCFR used a very tabular method, storing the regrets for each information set. But with Rebel, we use the idea of a function approximator to get the regrets.

Link to original Paper: https://arxiv.org/abs/2007.13544

Notation:

Over