Skip to content

Cannot reproduce results #24

@Nicolas99-9

Description

@Nicolas99-9

I tried to run mario_a2c.py, mario_ppo.py and mario_curio.py but for non of them I cannot improve the reward.
Did you use the same hyper-parameters as in the files to conduct the evaluation? (i.e. number of workers, learning rate)
Which version of the libraries did you use ?

For instance, A2C without ICM: (after 3M time-steps)

Screenshot from 2019-08-06 16-38-51

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions