User profiles for Daniel Toyama
Daniel ToyamaGoogle DeepMind Verified email at google.com Cited by 18467 |
Androidworld: A dynamic benchmarking environment for autonomous agents
Autonomous agents that execute human tasks by controlling computers can enhance human
productivity and application accessibility. However, progress in this field will be driven by …
productivity and application accessibility. However, progress in this field will be driven by …
The option keyboard: Combining skills in reinforcement learning
The ability to combine known skills to create new ones may be crucial in the solution of
complex reinforcement learning problems that unfold over extended periods. We argue that a …
complex reinforcement learning problems that unfold over extended periods. We argue that a …
Not all llm reasoners are created equal
We study the depth of grade-school math (GSM) problem-solving capabilities of LLMs. To
this end, we evaluate their performance on pairs of existing math word problems together so …
this end, we evaluate their performance on pairs of existing math word problems together so …
Alphastar unplugged: Large-scale offline reinforcement learning
StarCraft II is one of the most challenging simulated reinforcement learning environments; it
is partially observable, stochastic, multi-agent, and mastering StarCraft II requires strategic …
is partially observable, stochastic, multi-agent, and mastering StarCraft II requires strategic …
Starcraft ii unplugged: Large scale offline reinforcement learning
StarCraft II is one of the most challenging reinforcement learning (RL) environments; it is
partially observable, stochastic, and multi-agent, and mastering StarCraft II requires strategic …
partially observable, stochastic, and multi-agent, and mastering StarCraft II requires strategic …
Finding increasingly large extremal graphs with alphazero and tabu search
This work studies a central extremal graph theory problem inspired by a 1975 conjecture of
Erd\H{o}s, which aims to find graphs with a given size (number of nodes) that maximize the …
Erd\H{o}s, which aims to find graphs with a given size (number of nodes) that maximize the …
Knowledge representation for reinforcement learning using general value functions
Reinforcement learning (RL) is a very powerful approach for learning good control strategies
from data. Value functions are a key concept for reinforcement learning, as they guide the …
from data. Value functions are a key concept for reinforcement learning, as they guide the …
Performance, Sex, and Phenotypic Evolution in Insular and Continental Anole Lizards
K Toyama - 2024 - utoronto.scholaris.ca
… have obviously been influenced by one or the other, so at this point I think is fair to say that
this work, and the past and the future ones, also belong to Daniel Toyama and Sara Campos. …
this work, and the past and the future ones, also belong to Daniel Toyama and Sara Campos. …
Apoptotic Force and Tissue Dynamics During Drosophila Embryogenesis
Understanding cell morphogenesis during metazoan development requires knowledge of
how cells and the extracellular matrix produce and respond to forces. We investigated how …
how cells and the extracellular matrix produce and respond to forces. We investigated how …
[PDF][PDF] A NOTE ON n-CAYLEY GRAPHS: EXPANDER FAMILIES AND GALOIS COVERINGS
N Toyama - 2024 - math.u-ryukyu.ac.jp
A graph Γ is called an n-Cayley graph over a group G if there exists a semiregular subgroup
of Aut (Γ) that is isomorphic to G with n orbits (of equal size). This is one of the …
of Aut (Γ) that is isomorphic to G with n orbits (of equal size). This is one of the …