(2025). RICL: Adding In-Context Adaptability to Pre-Trained Vision-Language-Action Models. CORL.
(2025). VLMgineer: Vision-Language Models as Robotic Toolsmiths. arXiv preprint arXiv:2507.12644.
(2025). Real-World Reinforcement Learning of Interactive Perception Behaviors. NeurIPS.
(2025). Pixie: Fast and Generalizable Supervised Learning of 3D Physics from Pixels. arXiv preprint arXiv:2508.17437.
(2025). RoboArena: Distributed Real-World Evaluation of Generalist Robot Policies. CORL.
(2025). ZeroMimic: Distilling Robotic Manipulation Skills from Web Videos. ICRA.
(2025). Vision Language Models are In-Context Value Learners. ICLR.
(2025). The Value of Sensory Information to a Robot. ICLR.
(2025). REGENT: A Retrieval-Augmented Generalist Agent That Can Act In-Context in New Environments. ICLR.
(2025). Leveraging Symmetry to Accelerate Learning of Trajectory Tracking Controllers for Free-Flying Robotic Systems. ICRA.
(2025). Learning to Achieve Goals with Belief State Transformers. ICLR.
(2025). Illustrated Landmark Graphs for Long-Horizons Policy Learning. TMLR.
(2025). Articulate-Anything: Automatic Modeling of Articulated Objects via a Vision-Language Foundation Model. ICLR.
(2024). Task-Oriented Hierarchical Object Decomposition for Visuomotor Control . CORL.
(2024). Eurekaverse: Environment Curriculum Generation via Large Language Models. CORL (oral).
(2024). ZeroFlow: Fast Zero Label Scene Flow via Distillation. ICLR.
(2024). Universal Visual Decomposer: Long-Horizon Manipulation Made Easy. ICRA.
(2024). Training self-learning circuits for power-efficient solutions. Applied Physics Letters (APL) Machine Learning.
(2024). Privileged Sensing Scaffolds Reinforcement Learning. ICLR.
(2024). Open X-Embodiment: Robotic Learning Datasets and RT-X Models. ICRA.
(2024). Memory-Consistent Neural Networks for Imitation Learning. ICLR.
(2024). Long-HOT: A Modular Hierarchical Approach for Long-Horizon Object Transport. ICRA.
(2024). Eureka: Human-Level Reward Design via Coding Large Language Models. ICLR.
(2024). DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset. RSS.
(2024). DrEureka: Language Model Guided Sim-To-Real Transfer. RSS.
(2023). TLControl: Trajectory and Language Control for Human Motion Synthesis. arXiv.