Two-way-Deconfounder

The source code for the paper ‘Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning,’ which has been accepted for publication at NeurIPS 2024, is available in this repository

Run the Code

Part 1: Generate simulation datasets

python sim_toy.py --d_seed 11 --d_number 1000 --e_degree 1.0 --c_degree 1.0
python sim_tumor.py --d_seed 11 --d_number 1000 --e_degree 1.0 --c_degree 1.0

Part 2: Generate the true value of the target policy using Monte Carlo methods

python MCTrue_toy.py --d_seed 11 --d_number 1000 --e_degree 1.0 --c_degree 1.0 --MC 10000
python MCTrue_tumor.py --d_seed 11 --d_number 1000 --e_degree 1.0 --c_degree 1.0 --MC 10000

Part 3: train model

python tune_toy.py --d_seed 11 --d_number 1000 --e_degree 1.0 --c_degree 1.0 --method TWD
python tune_tumor.py --d_seed 11 --d_number 1000 --e_degree 1.0 --c_degree 1.0 --method TWD

Part 3: Generate the estimated value of the target policy using the above trained model

python toy_eval.py --d_seed 11 --d_number 1000 --e_degree 1.0 --c_degree 1.0 --method TWD
python tumor_eval.py --d_seed 11 --d_number 1000 --e_degree 1.0 --c_degree 1.0 --method TWD

Contact

I will continue to update the code over the next few days. please contract 24121534R@connect.polyu.hk if you have any questions

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
config		config
tables/toy		tables/toy
toymodels		toymodels
tumormodels		tumormodels
utils		utils
MCTrue_toy.py		MCTrue_toy.py
MCTrue_tumor.py		MCTrue_tumor.py
README.md		README.md
sim_toy.py		sim_toy.py
sim_tumor.py		sim_tumor.py
toy_eval.py		toy_eval.py
tumor_eval.py		tumor_eval.py
tune_toy.py		tune_toy.py
tune_tumor.py		tune_tumor.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Two-way-Deconfounder

Run the Code

Part 1: Generate simulation datasets

Part 2: Generate the true value of the target policy using Monte Carlo methods

Part 3: train model

Part 3: Generate the estimated value of the target policy using the above trained model

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Two-way-Deconfounder

Run the Code

Part 1: Generate simulation datasets

Part 2: Generate the true value of the target policy using Monte Carlo methods

Part 3: train model

Part 3: Generate the estimated value of the target policy using the above trained model

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages