Deadly Triad

This is idea from RL book.

The 3 deadly things:

  1. Function Approximation
  2. Bootstrapping
  3. Off-policy training