Bellman Error L(ϕ,D)=(s,a,r,s′,d)∼DE(Qϕ(s,a)−(r+γ(1−d)maxa′Qϕ(s′,a′)))2 https://spinningup.openai.com/en/latest/algorithms/ddpg.html