Reinforcement Learning

VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training
How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via $ f $-Advantage Regression
SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching
Know Thyself: Transferable Visuomotor Control Through Robot-Awareness
Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning
Prospective Learning: Back to the Future
Conservative Offline Distributional Reinforcement Learning