Offline Reinforcement Learning

Offline RL uses previously collected data without any additional data collection.