Skip to content

hayashimasa/Robust_MuZero

Repository files navigation

Robust MuZero

A robust variant of MuZero

Authors: Masahiro Hayashi, Bakr Ouairem

Supervisors: Edouard Leurent, Odalric-Ambrym Maillard

Overview

In order to propose a robust variant of MuZero, we first have to make the state transitions stochastic; to do so, we incorporate the work from PlaNet; namely, we integrate the SSM and RSSM model into the dynamics function of MuZero. We also modify MCTS so that the agent plans conservatively and robustly. So far, we haven't built a model that converges to any meaningful result.

About

A robust variant of MuZero

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages