default search action
Sebastian Flennerhag
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c13]Robert Tjarko Lange, Tom Schaul, Yutian Chen, Tom Zahavy, Valentin Dalibard, Chris Lu, Satinder Singh, Sebastian Flennerhag:
Discovering Evolution Strategies via Meta-Black-Box Optimization. GECCO Companion 2023: 29-30 - [c12]Robert Tjarko Lange, Tom Schaul, Yutian Chen, Chris Lu, Tom Zahavy, Valentin Dalibard, Sebastian Flennerhag:
Discovering Attention-Based Genetic Algorithms via Meta-Black-Box Optimization. GECCO 2023: 929-937 - [c11]Robert Tjarko Lange, Tom Schaul, Yutian Chen, Tom Zahavy, Valentin Dalibard, Chris Lu, Satinder Singh, Sebastian Flennerhag:
Discovering Evolution Strategies via Meta-Black-Box Optimization. ICLR 2023 - [c10]Tom Zahavy, Yannick Schroecker, Feryal M. P. Behbahani, Kate Baumli, Sebastian Flennerhag, Shaobo Hou, Satinder Singh:
Discovering Policies with DOMiNO: Diversity Optimization Maintaining Near Optimality. ICLR 2023 - [c9]Ted Moskovitz, Brendan O'Donoghue, Vivek Veeriah, Sebastian Flennerhag, Satinder Singh, Tom Zahavy:
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs. ICML 2023: 25303-25336 - [c8]Sebastian Flennerhag, Tom Zahavy, Brendan O'Donoghue, Hado Philip van Hasselt, András György, Satinder Singh:
Optimistic Meta-Gradients. NeurIPS 2023 - [i18]Sebastian Flennerhag, Tom Zahavy, Brendan O'Donoghue, Hado van Hasselt, András György, Satinder Singh:
Optimistic Meta-Gradients. CoRR abs/2301.03236 (2023) - [i17]Ted Moskovitz, Brendan O'Donoghue, Vivek Veeriah, Sebastian Flennerhag, Satinder Singh, Tom Zahavy:
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs. CoRR abs/2302.01275 (2023) - [i16]Robert Tjarko Lange, Tom Schaul, Yutian Chen, Chris Lu, Tom Zahavy, Valentin Dalibard, Sebastian Flennerhag:
Discovering Attention-Based Genetic Algorithms via Meta-Black-Box Optimization. CoRR abs/2304.03995 (2023) - [i15]Veronica Chelu, Tom Zahavy, Arthur Guez, Doina Precup, Sebastian Flennerhag:
Optimism and Adaptivity in Policy Optimization. CoRR abs/2306.10587 (2023) - [i14]Kate Baumli, Satinder Baveja, Feryal M. P. Behbahani, Harris Chan, Gheorghe Comanici, Sebastian Flennerhag, Maxime Gazeau, Kristian Holsheimer, Dan Horgan, Michael Laskin, Clare Lyle, Hussain Masoom, Kay McKinney, Volodymyr Mnih, Alexander Neitz, Fabio Pardo, Jack Parker-Holder, John Quan, Tim Rocktäschel, Himanshu Sahni, Tom Schaul, Yannick Schroecker, Stephen Spencer, Richie Steigerwald, Luyu Wang, Lei Zhang:
Vision-Language Models as a Source of Rewards. CoRR abs/2312.09187 (2023) - 2022
- [c7]Louis Kirsch, Sebastian Flennerhag, Hado van Hasselt, Abram L. Friesen, Junhyuk Oh, Yutian Chen:
Introducing Symmetries to Black Box Meta Reinforcement Learning. AAAI 2022: 7202-7210 - [c6]Jelena Luketina, Sebastian Flennerhag, Yannick Schroecker, David Abel, Tom Zahavy, Satinder Singh:
Meta-Gradients in Non-Stationary Environments. CoLLAs 2022: 886-901 - [c5]Andrei Alex Rusu, Sebastian Flennerhag, Dushyant Rao, Razvan Pascanu, Raia Hadsell:
Probing Transfer in Deep Reinforcement Learning without Task Engineering. CoLLAs 2022: 1231-1254 - [c4]Sebastian Flennerhag, Yannick Schroecker, Tom Zahavy, Hado van Hasselt, David Silver, Satinder Singh:
Bootstrapped Meta-Learning. ICLR 2022 - [i13]Tom Zahavy, Yannick Schroecker, Feryal M. P. Behbahani, Kate Baumli, Sebastian Flennerhag, Shaobo Hou, Satinder Singh:
Discovering Policies with DOMiNO: Diversity Optimization Maintaining Near Optimality. CoRR abs/2205.13521 (2022) - [i12]Jelena Luketina, Sebastian Flennerhag, Yannick Schroecker, David Abel, Tom Zahavy, Satinder Singh:
Meta-Gradients in Non-Stationary Environments. CoRR abs/2209.06159 (2022) - [i11]Andrei A. Rusu, Sebastian Flennerhag, Dushyant Rao, Razvan Pascanu, Raia Hadsell:
Probing Transfer in Deep Reinforcement Learning without Task Engineering. CoRR abs/2210.12448 (2022) - [i10]Robert Tjarko Lange, Tom Schaul, Yutian Chen, Tom Zahavy, Valentin Dallibard, Chris Lu, Satinder Singh, Sebastian Flennerhag:
Discovering Evolution Strategies via Meta-Black-Box Optimization. CoRR abs/2211.11260 (2022) - 2021
- [i9]Tom Zahavy, Brendan O'Donoghue, André Barreto, Volodymyr Mnih, Sebastian Flennerhag, Satinder Singh:
Discovering Diverse Nearly Optimal Policies withSuccessor Features. CoRR abs/2106.00669 (2021) - [i8]Sebastian Flennerhag, Yannick Schroecker, Tom Zahavy, Hado van Hasselt, David Silver, Satinder Singh:
Bootstrapped Meta-Learning. CoRR abs/2109.04504 (2021) - [i7]Louis Kirsch, Sebastian Flennerhag, Hado van Hasselt, Abram L. Friesen, Junhyuk Oh, Yutian Chen:
Introducing Symmetries to Black Box Meta Reinforcement Learning. CoRR abs/2109.10781 (2021) - 2020
- [c3]Sebastian Flennerhag, Andrei A. Rusu, Razvan Pascanu, Francesco Visin, Hujun Yin, Raia Hadsell:
Meta-Learning with Warped Gradient Descent. ICLR 2020 - [i6]Adriano S. Koshiyama, Sebastian Flennerhag, Stefano B. Blumberg, Nick Firoozye, Philip C. Treleaven:
QuantNet: Transferring Learning Across Systematic Trading Strategies. CoRR abs/2004.03445 (2020) - [i5]Sebastian Flennerhag, Jane X. Wang, Pablo Sprechmann, Francesco Visin, Alexandre Galashov, Steven Kapturowski, Diana L. Borsa, Nicolas Heess, André Barreto, Razvan Pascanu:
Temporal Difference Uncertainties as a Signal for Exploration. CoRR abs/2010.02255 (2020)
2010 – 2019
- 2019
- [c2]Sebastian Flennerhag, Pablo Garcia Moreno, Neil D. Lawrence, Andreas C. Damianou:
Transferring Knowledge across Learning Processes. ICLR 2019 - [i4]Konstantin Klemmer, Adriano S. Koshiyama, Sebastian Flennerhag:
Augmenting correlation structures in spatial data using deep generative models. CoRR abs/1905.09796 (2019) - [i3]Sebastian Flennerhag, Andrei A. Rusu, Razvan Pascanu, Hujun Yin, Raia Hadsell:
Meta-Learning with Warped Gradient Descent. CoRR abs/1909.00025 (2019) - 2018
- [c1]Sebastian Flennerhag, Hujun Yin, John A. Keane, Mark J. Elliot:
Breaking the Activation Function Bottleneck through Adaptive Parameterization. NeurIPS 2018: 7750-7761 - [i2]Sebastian Flennerhag, Hujun Yin, John A. Keane, Mark J. Elliot:
Breaking the Activation Function Bottleneck through Adaptive Parameterization. CoRR abs/1805.08574 (2018) - [i1]Sebastian Flennerhag, Pablo Garcia Moreno, Neil D. Lawrence, Andreas C. Damianou:
Transferring Knowledge across Learning Processes. CoRR abs/1812.01054 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:08 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint