VDPO: Variational Delayed Policy Optimization

guidelines

1. requirement

conda create -n VDPO python=3.10
conda activate VDPO
pip install -r requirement.yaml
pip install gymnasium[mujoco]

2. run the VDPO

python3 VDPO.py --env=Ant-v4 --delay=5

Citation

@article{wu2024variational,
  title={Variational Delayed Policy Optimization},
  author={Wu, Qingyuan and Zhan, Simon Sinong and Wang, Yixuan and Wang, Yuhui and Lin, Chung-Wei and Lv, Chen and Zhu, Qi and Huang, Chao},
  booktitle={38th Conference on Neural Information Processing Systems},
  year={2024}
}

Acknowledgement

CleanRL: https://github.com/vwxyzjn/cleanrl
SAC: https://github.com/haarnoja/sac
AD-RL: https://github.com/QingyuanWuNothing/AD-RL

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
VDPO.py		VDPO.py
belief.py		belief.py
make_env.py		make_env.py
nn.py		nn.py
requirement.yaml		requirement.yaml
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VDPO: Variational Delayed Policy Optimization

guidelines

1. requirement

2. run the VDPO

Citation

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VDPO: Variational Delayed Policy Optimization

guidelines

1. requirement

2. run the VDPO

Citation

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages