User profiles for Daniel Toyama

Daniel Toyama

Google DeepMind
Verified email at google.com
Cited by 18467

Androidworld: A dynamic benchmarking environment for autonomous agents

…, W Li, F Campbell-Ajala, D Toyama… - International …, 2025 - proceedings.iclr.cc
Autonomous agents that execute human tasks by controlling computers can enhance human
productivity and application accessibility. However, progress in this field will be driven by …

The option keyboard: Combining skills in reinforcement learning

…, E Aygün, P Hamel, D Toyama… - Advances in …, 2019 - proceedings.neurips.cc
The ability to combine known skills to create new ones may be crucial in the solution of
complex reinforcement learning problems that unfold over extended periods. We argue that a …

Not all llm reasoners are created equal

A Hosseini, A Sordoni, D Toyama, A Courville… - arXiv preprint arXiv …, 2024 - arxiv.org
We study the depth of grade-school math (GSM) problem-solving capabilities of LLMs. To
this end, we evaluate their performance on pairs of existing math word problems together so …

Alphastar unplugged: Large-scale offline reinforcement learning

…, J Schrittwieser, D Choi, P Georgiev, D Toyama… - arXiv preprint arXiv …, 2023 - arxiv.org
StarCraft II is one of the most challenging simulated reinforcement learning environments; it
is partially observable, stochastic, multi-agent, and mastering StarCraft II requires strategic …

Starcraft ii unplugged: Large scale offline reinforcement learning

…, D Choi, P Georgiev, DK Toyama… - Deep RL Workshop …, 2021 - openreview.net
StarCraft II is one of the most challenging reinforcement learning (RL) environments; it is
partially observable, stochastic, and multi-agent, and mastering StarCraft II requires strategic …

Finding increasingly large extremal graphs with alphazero and tabu search

…, A Lee, A Ruoss, A Bulanova, D Toyama… - arXiv preprint arXiv …, 2023 - arxiv.org
This work studies a central extremal graph theory problem inspired by a 1975 conjecture of
Erd\H{o}s, which aims to find graphs with a given size (number of nodes) that maximize the …

Knowledge representation for reinforcement learning using general value functions

G Comanici, D Precup, A Barreto, DK Toyama, E Aygün… - 2018 - openreview.net
Reinforcement learning (RL) is a very powerful approach for learning good control strategies
from data. Value functions are a key concept for reinforcement learning, as they guide the …

Performance, Sex, and Phenotypic Evolution in Insular and Continental Anole Lizards

K Toyama - 2024 - utoronto.scholaris.ca
… have obviously been influenced by one or the other, so at this point I think is fair to say that
this work, and the past and the future ones, also belong to Daniel Toyama and Sara Campos. …

Apoptotic Force and Tissue Dynamics During Drosophila Embryogenesis

Y Toyama, XG Peralta, AR Wells, DP Kiehart… - Science, 2008 - science.org
Understanding cell morphogenesis during metazoan development requires knowledge of
how cells and the extracellular matrix produce and respond to forces. We investigated how …

[PDF][PDF] A NOTE ON n-CAYLEY GRAPHS: EXPANDER FAMILIES AND GALOIS COVERINGS

N Toyama - 2024 - math.u-ryukyu.ac.jp
A graph Γ is called an n-Cayley graph over a group G if there exists a semiregular subgroup
of Aut (Γ) that is isomorphic to G with n orbits (of equal size). This is one of the …