A really simple example / `test run' of learning with our BHDEnv. Uses the "continous DQN" DDPG agent. Ideally, this should run without errors. But right now, it doesn't. See README.
This is an example package that provides expert reinforcement learning policies that can be run on the TriFinger robot cluster. You can use it as base for your own package when submitting to the robot ...
Deep reinforcement learning (DRL) is transitioning from a research field focused on game playing to a technology with real-world applications. Notable examples include DeepMind’s work on controlling a ...
Diagram of MURAL, our method for learning uncertainty-aware rewards for RL. After the user provides a few examples of desired outcomes, MURAL automatically infers a reward function that takes into ...