NUDGE

This is the implementation of Neurally gUided Differentiable loGic policiEs (NUDGE), a framework for logic RL agents based on differentiable forward reasoning with first-order logic (FOL).

Quickstart

Install all requirements via
```
pip install -r requirements.txt
```
On project level, simply run python train.py to start a new training run.

How to Use

Training

To train a new agent, run python train.py. The training process is controlled by the hyperparameters specified in in/config/default.yaml. You can specify a different configuration by providing the corresponding YAML file path as an argument, e.g., python train.py -c in/config/my_config.yaml. The -c argument is optional and defaults to in/config/default.yaml. You can also overwrite the game environment by providing the -g argument, e.g., python train.py -g freeway.

Hyperparameters

The hyperparameters are configured inside in/config/default.yaml which is loaded as default. You can specify a different configuration by providing the corresponding YAML file path as an argument, e.g., python train.py in/config/my_config.yaml. A description of all hyperparameters can be found in train.py.

The Logic

Inside in/envs/[env_name]/logic/[ruleset_name]/, you find the logic rules that are used as a starting point for training. You can change them or create new rule sets. The ruleset to use is specified with the hyperparam rules.

Install Locally

If you want to use NUDGE within other projects, you can install NUDGE locally as follows:

Inside nsfr/ run
```
python setup.py develop
```
Inside nudge/ run
```
python setup.py develop
```

Optional: Threefish and Loot Environments

In case you want to use the Threefish or the Loot environment, you also need to install QT-5 via

apt-get install qt5-default

Playing

python3 play_gui.py -g seaquest

Environments and their Variants

Getout

getout contains key, door and one enemy.
getoutplus has one more enemy.

Threefish

threefish contains one bigger fish and one smaller fish.
threefishcolor contains one red fish and one green fish. agent need to avoid red fish and eat green fish.

Loot

loot contains 2 pairs of key and door.
lootcolor contains 2 pairs of key and door with different color than in loot.
lootplus contains 3 pairs of key and door.

How to Set up New Environments

You add a new environment inside in/envs/[new_env_name]/. There, you need to define a NudgeEnv class that wraps the original environment in order to do

logic state extraction: translates raw env states into logic representations
valuation: Each relation (like closeby) has a corresponding valuation function which maps the (logic) game state to a probability that the relation is true. Each valuation function is defined as a simple Python function. The function's name must match the name of the corresponding relation.
action mapping: action-predicates predicted by the agent need to be mapped to the actual env actions

See the freeway env to see how it is done.

Misc

TODO Using Beam Search to find a set of rules

With scoring:

python3 beam_search.py -m getout -r getout_root -t 3 -n 8 --scoring True -d getout.json

Without scoring:

python3 beam_search.py -m threefish -r threefishm_root -t 3 -n 8

--t: Number of rule expansion of clause generation.
--n: The size of the beam.
--scoring: To score the searched rules, a dataset of states information is required.
-d: The name of dataset to be used for scoring.

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
docs		docs
env_src		env_src
in		in
nsfr		nsfr
nudge		nudge
results		results
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
play.py		play.py
play_gui.py		play_gui.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NUDGE

Quickstart

How to Use

Training

Hyperparameters

The Logic

Install Locally

Optional: Threefish and Loot Environments

Playing

Environments and their Variants

Getout

Threefish

Loot

How to Set up New Environments

Misc

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

k4ntz/NUDGE

Folders and files

Latest commit

History

Repository files navigation

NUDGE

Quickstart

How to Use

Training

Hyperparameters

The Logic

Install Locally

Optional: Threefish and Loot Environments

Playing

Environments and their Variants

Getout

Threefish

Loot

How to Set up New Environments

Misc

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages