Entropy

Maximum Entropy

This is actually super useful and practical, because the world is full of Uncertainty.

The Entropy is given by

This is another way to formulate. To take into account uncertainty, so for Robustness.

We use Constrained Optimization to come up with a set of equations.

This is a really important derivation

ahh you want to maximize entropy so that the policy is not as deterministic

Max-entropy Value Iteration