Google Scholar

User profiles for Daniel Toyama

Daniel Toyama

Google DeepMind

Verified email at google.com

Cited by 18467

[PDF] iclr.cc

Androidworld: A dynamic benchmarking environment for autonomous agents

…, W Li, F Campbell-Ajala, D Toyama… - International …, 2025 - proceedings.iclr.cc

Autonomous agents that execute human tasks by controlling computers can enhance human
productivity and application accessibility. However, progress in this field will be driven by …

Save Cite Cited by 303 Related articles All 3 versions View as HTML

[PDF] neurips.cc

The option keyboard: Combining skills in reinforcement learning

…, E Aygün, P Hamel, D Toyama… - Advances in …, 2019 - proceedings.neurips.cc

The ability to combine known skills to create new ones may be crucial in the solution of
complex reinforcement learning problems that unfold over extended periods. We argue that a …

Save Cite Cited by 157 Related articles All 11 versions View as HTML

[PDF] arxiv.org

Not all llm reasoners are created equal

A Hosseini, A Sordoni, D Toyama, A Courville… - arXiv preprint arXiv …, 2024 - arxiv.org

We study the depth of grade-school math (GSM) problem-solving capabilities of LLMs. To
this end, we evaluate their performance on pairs of existing math word problems together so …

Save Cite Cited by 29 Related articles All 4 versions View as HTML

[PDF] arxiv.org

Alphastar unplugged: Large-scale offline reinforcement learning

…, J Schrittwieser, D Choi, P Georgiev, D Toyama… - arXiv preprint arXiv …, 2023 - arxiv.org

StarCraft II is one of the most challenging simulated reinforcement learning environments; it
is partially observable, stochastic, multi-agent, and mastering StarCraft II requires strategic …

Save Cite Cited by 38 Related articles All 2 versions View as HTML

[PDF] openreview.net

Starcraft ii unplugged: Large scale offline reinforcement learning

…, D Choi, P Georgiev, DK Toyama… - Deep RL Workshop …, 2021 - openreview.net

StarCraft II is one of the most challenging reinforcement learning (RL) environments; it is
partially observable, stochastic, and multi-agent, and mastering StarCraft II requires strategic …

Save Cite Cited by 35 Related articles View as HTML

[PDF] arxiv.org

Finding increasingly large extremal graphs with alphazero and tabu search

…, A Lee, A Ruoss, A Bulanova, D Toyama… - arXiv preprint arXiv …, 2023 - arxiv.org

This work studies a central extremal graph theory problem inspired by a 1975 conjecture of
Erd\H{o}s, which aims to find graphs with a given size (number of nodes) that maximize the …

Save Cite Cited by 21 Related articles All 8 versions View as HTML

[PDF] openreview.net

Knowledge representation for reinforcement learning using general value functions

G Comanici, D Precup, A Barreto, DK Toyama, E Aygün… - 2018 - openreview.net

Reinforcement learning (RL) is a very powerful approach for learning good control strategies
from data. Value functions are a key concept for reinforcement learning, as they guide the …

Save Cite Cited by 11 Related articles View as HTML

[PDF] scholaris.ca

Performance, Sex, and Phenotypic Evolution in Insular and Continental Anole Lizards

K Toyama - 2024 - utoronto.scholaris.ca

… have obviously been influenced by one or the other, so at this point I think is fair to say that
this work, and the past and the future ones, also belong to Daniel Toyama and Sara Campos. …

Save Cite Related articles

[PDF] nih.gov

Apoptotic Force and Tissue Dynamics During Drosophila Embryogenesis

Y Toyama, XG Peralta, AR Wells, DP Kiehart… - Science, 2008 - science.org

Understanding cell morphogenesis during metazoan development requires knowledge of
how cells and the extracellular matrix produce and respond to forces. We investigated how …

Save Cite Cited by 345 Related articles All 15 versions

[PDF] u-ryukyu.ac.jp

[PDF][PDF] A NOTE ON n-CAYLEY GRAPHS: EXPANDER FAMILIES AND GALOIS COVERINGS

N Toyama - 2024 - math.u-ryukyu.ac.jp

A graph Γ is called an n-Cayley graph over a group G if there exists a semiregular subgroup
of Aut (Γ) that is isomorphic to G with n orbits (of equal size). This is one of the …

Create alert

Cite

Advanced search

Saved to My library

User profiles for Daniel Toyama

Daniel Toyama

Androidworld: A dynamic benchmarking environment for autonomous agents

The option keyboard: Combining skills in reinforcement learning

Not all llm reasoners are created equal

Alphastar unplugged: Large-scale offline reinforcement learning

Starcraft ii unplugged: Large scale offline reinforcement learning

Finding increasingly large extremal graphs with alphazero and tabu search

Knowledge representation for reinforcement learning using general value functions

Performance, Sex, and Phenotypic Evolution in Insular and Continental Anole Lizards

Apoptotic Force and Tissue Dynamics During Drosophila Embryogenesis

[PDF][PDF] A NOTE ON n-CAYLEY GRAPHS: EXPANDER FAMILIES AND GALOIS COVERINGS