Keyframe-Focused Visual Imitation Learning

Oct 8, 8080·

Chuan Wen

Jierui Lin

Jianing Qian

Yang Gao

Dinesh Jayaraman

· 0 min read

PDF Cite arXiv Webpage Code

Abstract

Imitation learning trains control policies by mimicking pre-recorded expert demonstrations. In partially observable settings, imitation policies must rely on observation histories, but many seemingly paradoxical results show better performance for policies that only access the most recent observation. Recent solutions ranging from causal graph learning to deep information bottlenecks have shown promising results, but failed to scale to realistic settings such as visual imitation. We propose a solution that outperforms these prior approaches by upweighting demonstration keyframes corresponding to expert action changepoints. This simple approach easily scales to complex visual imitation settings. Our experimental results demonstrate consistent performance improvements over all baselines on image-based Gym MuJoCo continuous control tasks. Finally, on the CARLA photorealistic vision-based urban driving simulator, we resolve a long-standing issue in behavioral cloning for driving by demonstrating effective imitation from observation histories.

Type

Publication

In ICML

Last updated on Oct 8, 8080

Imitation Learning Causality Reinforcement Learning Distributional Shift

← Prospective Learning: Back to the Future Jan 1, 1010

Femtomolar SARS-CoV-2 Antigen Detection Using the Microbubbling Digital Assay with Smartphone Readout Enables Antigen Burden Quantitation and Dynamics Tracking Sep 1, 1010 →