Google Scholar

User profiles for Tim Hertweck

Tim Hertweck

Google DeepMind

Verified email at google.com

Cited by 335

[PDF] arxiv.org

The challenges of exploration for offline reinforcement learning

…, A Byravan, M Bloesch, V Dasagi, T Hertweck… - arXiv preprint arXiv …, 2022 - arxiv.org

Offline Reinforcement Learning (ORL) enablesus to separately study the two interlinked
processes of reinforcement learning: collecting informative experience and inferring optimal …

Save Cite Cited by 46 Related articles All 4 versions View as HTML

[PDF] mlr.press

Data-efficient hindsight off-policy option learning

…, T Lampe, A Abdolmaleki, T Hertweck… - International …, 2021 - proceedings.mlr.press

We introduce Hindsight Off-policy Options (HO2), a data-efficient option learning algorithm.
Given any trajectory, HO2 infers likely option choices and backpropagates through the …

Save Cite Cited by 53 Related articles All 6 versions View as HTML

[PDF] arxiv.org

Compositional transfer in hierarchical reinforcement learning

…, JT Springenberg, M Neunert, T Hertweck… - arXiv preprint arXiv …, 2019 - arxiv.org

The successful application of general reinforcement learning algorithms to real-world robotics
applications is often limited by their high data requirements. We introduce Regularized …

Save Cite Cited by 42 Related articles All 7 versions View as HTML

[PDF] mlr.press

Towards general and autonomous learning of core skills: A case study in locomotion

R Hafner, T Hertweck, P Klöppner… - … on Robot Learning, 2021 - proceedings.mlr.press

Modern Reinforcement Learning (RL) algorithms promise to solve difficult motor control
problems directly from raw sensory inputs. Their attraction is due in part to the fact that they can …

Save Cite Cited by 35 Related articles All 4 versions View as HTML

[PDF] arxiv.org

Is curiosity all you need? on the utility of emergent behaviours from curious exploration

…, M Wulfmeier, G Vezzani, V Dasagi, T Hertweck… - arXiv preprint arXiv …, 2021 - arxiv.org

Curiosity-based reward schemes can present powerful exploration mechanisms which
facilitate the discovery of solutions for complex, sparse or long-horizon tasks. However, as the …

Save Cite Cited by 25 Related articles All 3 versions View as HTML

[PDF] arxiv.org

Mastering stacking of diverse shapes with large-scale iterative reinforcement learning on real robots

…, O Groth, R Hafner, T Hertweck… - … on Robotics and …, 2024 - ieeexplore.ieee.org

Reinforcement learning solely from an agent’s self-generated data is often believed to be
infeasible for learning on real robots, due to the amount of data needed. However, if done right, …

Save Cite Cited by 6 Related articles All 3 versions

[PDF] arxiv.org

Simultaneously learning vision and feature-based control policies for real-world ball-in-a-cup

…, M Neunert, A Abdolmaleki, T Hertweck… - arXiv preprint arXiv …, 2019 - arxiv.org

We present a method for fast training of vision based control policies on real robots. The key
idea behind our method is to perform multi-task Reinforcement Learning with auxiliary tasks …

Save Cite Cited by 31 Related articles All 5 versions View as HTML

[PDF] arxiv.org

Replay across experiments: A natural extension of off-policy rl

…, S Huang, G Lever, B Moran, T Hertweck… - arXiv preprint arXiv …, 2023 - arxiv.org

Replaying data is a principal mechanism underlying the stability and data efficiency of off-policy
reinforcement learning (RL). We present an effective yet simple framework to extend the …

Save Cite Cited by 6 Related articles All 4 versions View as HTML

[PDF] arxiv.org

Less is more--the Dispatcher/Executor principle for multi-task Reinforcement Learning

M Riedmiller, T Hertweck, R Hafner - arXiv preprint arXiv:2312.09120, 2023 - arxiv.org

Humans instinctively know how to neglect details when it comes to solve complex decision
making problems in environments with unforeseeable variations. This abstraction process …

Save Cite Cited by 2 Related articles All 2 versions View as HTML

[PDF] academia.edu

[PDF][PDF] Regularized hierarchical policies for compositional transfer in robotics

…, JT Springenberg, M Neunert, T Hertweck… - arXiv preprint arXiv …, 2019 - academia.edu

The successful application of flexible, general learning algorithms—such as deep
reinforcement learning—to real-world robotics applications is often limited by their poor data-efficiency…

Save Cite Cited by 28 Related articles View as HTML

Create alert

Cite

Advanced search

Saved to My library

User profiles for Tim Hertweck

Tim Hertweck

The challenges of exploration for offline reinforcement learning

Data-efficient hindsight off-policy option learning

Compositional transfer in hierarchical reinforcement learning

Towards general and autonomous learning of core skills: A case study in locomotion

Is curiosity all you need? on the utility of emergent behaviours from curious exploration

Mastering stacking of diverse shapes with large-scale iterative reinforcement learning on real robots

Simultaneously learning vision and feature-based control policies for real-world ball-in-a-cup

Replay across experiments: A natural extension of off-policy rl

Less is more--the Dispatcher/Executor principle for multi-task Reinforcement Learning

[PDF][PDF] Regularized hierarchical policies for compositional transfer in robotics