Reinforcement Learning | Dinesh Jayaraman

VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training

Yecheng Jason Ma, Shagun Sodhani, Dinesh Jayaraman, Osbert Bastani, Vikash Kumar, Amy Zhang

Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning

Kun Huang, Edward Hu, Dinesh Jayaraman

How Far I'll Go: Offline Goal-Conditioned Reinforcement Learning via $ f $-Advantage Regression

Yecheng Jason Ma, Jason Yan, Dinesh Jayaraman, Osbert Bastani

SMODICE: Versatile Offline Imitation Learning via State Occupancy Matching

Yecheng Jason Ma, Andrew Shen, Dinesh Jayaraman, Osbert Bastani

Know Thyself: Transferable Visuomotor Control Through Robot-Awareness

Edward S. Hu, Kun Huang, Oleh Rybkin, Dinesh Jayaraman

Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning

Yecheng Jason Ma, Andrew Shen, Osbert Bastani, Dinesh Jayaraman

Prospective Learning: Back to the Future

Joshua T Vogelstein, Timothy Verstynen, Konrad P Kording, Leyla Isik, John W Krakauer, Ralph Etienne-Cummings, Elizabeth L Ogburn, Carey E Priebe, Randal Burns, Kwame Kutten, James J Knierim, James B Potash, Thomas Hartung, Lena Smirnova, Paul Worley, Alena Savonenko, Ian Phillips, Michael I Miller, Rene Vidal, Jeremias Sulam, Adam Charles, Noah J Cowan, Maxim Bichuch, Archana Venkataraman, Chen Li, Nitish Thakor, Justus M Kebschull, Marilyn Albert, Jinchong Xu, Marshall Hussain Shuler, Brian Caffo, Tilak Ratnanather, Ali Geisa, Seung-Eon Roh, Eva Yezerets, Meghana Madhyastha, Javier J How, Tyler M Tomita, Jayanta Dey, Ningyuan Huang, Jong M Shin, Kaleab Alemayehu Kinfu, Pratik Chaudhari, Ben Baker, Anna Schapiro, Dinesh Jayaraman, Eric Eaton, Michael Platt, Lyle Ungar, Leila Wehbe, Adam Kepecs, Amy Christensen, Onyema Osuagwu, Bing Brunton, Brett Mensh, Alysson R Muotri, Gabriel Silva, Francesca Puppo, Florian Engert, Elizabeth Hillman, Julia Brown, Chris White, Weiwei Yang

Keyframe-Focused Visual Imitation Learning

Identifying and upsampling important frames from demonstration data can significantly boost imitation learning from histories, and scales easily to complex settings such as autonomous driving from vision.

Chuan Wen, Jierui Lin, Jianing Qian, Yang Gao, Dinesh Jayaraman

Conservative Offline Distributional Reinforcement Learning

Yecheng Jason Ma, Dinesh Jayaraman, Osbert Bastani

How Are Learned Perception-Based Controllers Impacted by the Limits of Robust Control?

We show empirically that the sample complexity and asymptotic performance of learned non-linear controllers in partially observable settings continues to follow theoretical limits based on the difficulty of state estimation

Jingxi Xu, Bruce Lee, Nikolai Matni, Dinesh Jayaraman