It seems harder than ever to agree with others on basic facts, let alone to develop […]
Programs like AlphaZero and GPT-3 are massive accomplishments: they represent years of sustained work solving a […]
A couple of years ago, Pete Skomoroch, Roger Magoulas, and I talked about the problems of […]
Fig. 1: The BRIDGE dataset contains 7200 demonstrations of kitchen-themed manipulation tasks across 71 tasks in […]
Many experimental works have observed that generalization in deep RL appears to be difficult: although RL […]
An example of our method deployed on a Clearpath Jackal ground robot (left) exploring a suburban […]
Many experimental works have observed that generalization in deep RL appears to be difficult: although RL […]
Fig 1. Measures of generalization performance for neural networks trained on four different boolean functions (colors) […]
Diagram of MURAL, our method for learning uncertainty-aware rewards for RL. After the user provides a […]
Sequence Modeling Solutions for Reinforcement Learning Problems Long-horizon predictions of (top) the Trajectory Transformer compared to […]