GitHub - JiahengHu/DUSDi: Official Implementation of DUSDi (NeurIPS24)

DUSDi: Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning (NeurIPS 2024)

This codebase was modified based on URLB.

Requirements

We assume you have access to a GPU that can run CUDA 11.7. Then, the simplest way to install all required dependencies is to create an anaconda environment by running

conda env create -f conda_env.yml

After the installation ends, you can activate your environment with

conda activate dusdi

After the environment is activated, set up the PettingZoom repo:

git clone https://github.com/JiahengHu/Pettingzoo-skill.git
cd Pettingzoo-skill
pip install -e .

Install pytorch with cuda support:

pip install torch==2.0.0 torchvision==0.15.0 torchaudio==2.0.0 --index-url https://download.pytorch.org/whl/cu117

Instructions

Pre-training

To run pre-training use the pretrain.py script. For example:

python pretrain.py agent=dusdi_diayn domain=particle agent.skill_dim=5 env.particle.N=10 exp_nm="test"

The snapshots will be stored under the following directory:

./models/<obs_type>/<domain>/<agent>/

Downstream Hierarchical Learning

Once you have pre-trained your method, you can use the saved snapshots to learn downstream task. For example:

python train.py domain=particle ds_task=poison_l low_path="seed:2 particle dusdi_diayn test"

Checkout finetune.py for baselines that don't learn skills.

Monitoring

Logs are stored in the exp_local folder. To launch tensorboard run:

tensorboard --logdir exp_local

The console output is also available in a form:

| train | F: 6000 | S: 3000 | E: 6 | L: 1000 | R: 5.5177 | FPS: 96.7586 | T: 0:00:42

a training entry decodes as

F  : total number of environment frames
S  : total number of agent steps
E  : total number of episodes
R  : episode return
FPS: training throughput (frames per second)
T  : total training time

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
agent		agent
custom_env		custom_env
env		env
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
compute_dci.py		compute_dci.py
conda_env.yml		conda_env.yml
env_helper.py		env_helper.py
finetune.py		finetune.py
finetune.yaml		finetune.yaml
logger.py		logger.py
pretrain.py		pretrain.py
pretrain.yaml		pretrain.yaml
replay_buffer.py		replay_buffer.py
requirements.txt		requirements.txt
train.py		train.py
train.yaml		train.yaml
utils.py		utils.py
video.py		video.py
wrapper.py		wrapper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DUSDi: Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning (NeurIPS 2024)

Requirements

Instructions

Pre-training

Downstream Hierarchical Learning

Monitoring

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DUSDi: Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning (NeurIPS 2024)

Requirements

Instructions

Pre-training

Downstream Hierarchical Learning

Monitoring

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages