A Minimalist Approach to Offline Reinforcement Learning (TD3 + BC)

This paper’s core contribution is just add the BC loss to DDPG, and shows great improvement, i.e.

Shown in the Is Value Learning Really the Main Bottleneck in Offline RL.