Notation

One of the things that I find super annoying is the inconsistency of notations across different sources.

For example, in Reinforcement Learning, learning Value Iteration from CS287 and David Silver produce really different results.

Ahh, but I think I get the notation difference reasoning, because in CS287, that notation is closer to the implementation in code, which our Source of Truth.