Deadly Triad This is idea from RL book. The 3 deadly things: Function Approximation Bootstrapping Off-policy training