-
Notifications
You must be signed in to change notification settings - Fork 94
feat: implement the sliding tile puzzle env #189
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: implement the sliding tile puzzle env #189
Conversation
sash-a
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks really great! Couple minor changes and we got to wait and see how well it performs 🔥
clement-bonnet
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for the PR! Left a few comments on a first review :)
|
What's the status of this PR? Anything I could help with? |
sash-a
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry this took so long, but awesome work @ElshadaiK I think it's pretty much ready just some minor comments from my side
clement-bonnet
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM overall, still investigating why training doesn't work.
0284549 to
72af3fe
Compare
sash-a
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just some small things we need to add here. Also need to add this mkdocs.yaml and add the gif
Co-authored-by: Sasha <reallysasha@gmail.com>
Co-authored-by: Sasha <reallysasha@gmail.com>
Co-authored-by: Sasha <reallysasha@gmail.com>
Co-authored-by: Clément Bonnet <56230714+clement-bonnet@users.noreply.github.com>
Co-authored-by: Sasha <reallysasha@gmail.com>
Co-authored-by: Sasha <reallysasha@gmail.com>
Co-authored-by: Sasha <reallysasha@gmail.com>
5acc203 to
00d6980
Compare
Co-authored-by: Sasha <reallysasha@gmail.com>
clement-bonnet
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Details:
Implements the full
SlidingTilePuzzleenvironment with actor-critic and random networks as well as documentation.Notes:
Gifs are still to be updated. Training of an a2c agent is ongoing