Skip to content

Conversation

@ElshadaiK
Copy link
Contributor

Details:
Implements the full SlidingTilePuzzle environment with actor-critic and random networks as well as documentation.

Notes:
Gifs are still to be updated. Training of an a2c agent is ongoing

Copy link
Collaborator

@sash-a sash-a left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks really great! Couple minor changes and we got to wait and see how well it performs 🔥

Copy link
Contributor

@clement-bonnet clement-bonnet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for the PR! Left a few comments on a first review :)

@carlosgmartin
Copy link

What's the status of this PR? Anything I could help with?

sash-a
sash-a previously approved these changes Jan 11, 2024
Copy link
Collaborator

@sash-a sash-a left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry this took so long, but awesome work @ElshadaiK I think it's pretty much ready just some minor comments from my side

Copy link
Contributor

@clement-bonnet clement-bonnet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM overall, still investigating why training doesn't work.

@clement-bonnet clement-bonnet force-pushed the feat/add_sliding_tile_puzzle_environment branch from 0284549 to 72af3fe Compare March 12, 2024 14:27
Copy link
Collaborator

@sash-a sash-a left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just some small things we need to add here. Also need to add this mkdocs.yaml and add the gif

@clement-bonnet clement-bonnet force-pushed the feat/add_sliding_tile_puzzle_environment branch from 5acc203 to 00d6980 Compare March 13, 2024 17:40
clement-bonnet and others added 2 commits March 13, 2024 18:47
Co-authored-by: Sasha <reallysasha@gmail.com>
clement-bonnet
clement-bonnet previously approved these changes Mar 13, 2024
clement-bonnet
clement-bonnet previously approved these changes Mar 13, 2024
sash-a
sash-a previously approved these changes Mar 13, 2024
@clement-bonnet clement-bonnet dismissed stale reviews from sash-a and themself via b492701 March 13, 2024 20:21
clement-bonnet
clement-bonnet previously approved these changes Mar 13, 2024
Copy link
Contributor

@clement-bonnet clement-bonnet left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@djbyrne djbyrne merged commit a903c4f into instadeepai:main Mar 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants