Model-Based Control
Once you do policy evaluation, you can do control.
You have two methods to improve your policy:
#todo This is in Lecture 5: Model-Free Control, but I don’t remember seeing this, so you need to revisit Lecture 3.
Greedy Policy Improvement over V(s) requires model of MDP.