VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training There’s many papers down this line of work for implicit reward learning: