A Minimalist Approach to Offline Reinforcement Learning (TD3 + BC)
This paper’s core contribution is just add the BC loss to DDPG, and shows great improvement, i.e.
Shown in the Is Value Learning Really the Main Bottleneck in Offline RL.
This paper’s core contribution is just add the BC loss to DDPG, and shows great improvement, i.e.
Shown in the Is Value Learning Really the Main Bottleneck in Offline RL.