Maximum Entropy
This is actually super useful and practical, because the world is full of Uncertainty.
The Entropy is given by
This is another way to formulate. To take into account uncertainty, so for Robustness.
We use Constrained Optimization to come up with a set of equations.
This is a really important derivation
ahh you want to maximize entropy so that the policy is not as deterministic
Max-entropy Value Iteration