Refactor act by alexander-soare · Pull Request #68 · huggingface/lerobot

alexander-soare · 2024-04-08T08:00:09Z

Notes for reviewers:

I suggest primarily reading lerobot/common/policies/act.py from scratch (not in diff mode), as it's almost a complete rewrite.
There are some TODO(now) which need to be resolved before merging remove torchrl into the main branch.
Keep in mind while reviewing, there are still things to be done for the refactor but they are being deferred to near future PRs:
- Decoupling neural network optimization from the policy class.
- Splitting the update method up (https://github.com/orgs/huggingface/projects/46/views/1?pane=issue&itemId=57095229)
- A way to have model configs as dictionaries or dataclasses (and removing omegaconf objects from the core code).
You should be able to use DATA_DIR=data python lerobot/scripts/eval.py --hub-id lerobot/act_aloha_transfer_cube_human-original_repo eval_episodes=1 rollout_batch_size=1 to evaluate the ported weights from the original repo.
So far, I've verified that training sim_transfer_cube_human can be trained to match/surpass the original weights.

Still left to do

Reproduce original repo's eval score with original repo's weights (with torchrl)
Reproduce original repo's eval score with original repo's weights (using new Aloha Env)
- ~~Make sure this is reproducible with scripts/configs.~~ Upload converted weights, conversion script, converted stats, stats conversion script, to hub.
Train models for one human and one sim dataset, reproducing original results.
- sim_transfer_cube_human
- sim_insertion_scripted

Train on LeRobot with:

export DATA_DIR=data

python lerobot/scripts/train.py \
    hydra.job.name=act_aloha_sim_insertion_scripted \
    env=aloha \
    env.task=sim_insertion \
    dataset_id=aloha_sim_insertion_scripted \
    policy=act \
    log_freq=50 \
    eval_freq=2500 \
    rollout_batch_size=20 \
    eval_episodes=20 \
    policy.grad_clip_norm=100 \
    policy.use_vae=true \
    horizon=100 \
    wandb.enable=true \
    hydra.run.dir=outputs/train/act_aloha_sim_insertion_scripted \
    device=cuda \
    offline_steps=80000 \
    prefetch=4 \
    save_model=true \
    save_freq=5000 \

…orchrl' into refactor_act_remove_torchrl

Cadene · 2024-04-08T12:33:36Z

+        x = self.multihead_attn(
+            query=self.maybe_add_pos_embed(x, decoder_pos_embed),
+            key=self.maybe_add_pos_embed(encoder_out, encoder_pos_embed),
+            value=encoder_out,
+        )[0]


Why [0] ?

Cadene · 2024-04-08T12:33:44Z

+        if self.normalize_before:
+            x = self.norm1(x)
+        q = k = self.maybe_add_pos_embed(x, decoder_pos_embed)
+        x = self.self_attn(q, k, value=x)[0]


Cadene · 2024-04-08T12:34:27Z

+        Returns:
+            A (1, C, H, W) batch of corresponding sinusoidal positional embeddings.
+        """
+        not_mask = torch.ones_like(x[0, [0]])  # (1, H, W)


Why x[0, [0]]? Could we do something more readable?

Good shout. This is what I normally do (see revision). Is that more readable for you? Otherwise, I need to do ones and get the dtype and device.

Cadene · 2024-04-08T12:41:59Z

    dataloader = torch.utils.data.DataLoader(
        dataset,
-        num_workers=4,
+        num_workers=0,


remove before merging no?

Yep sorry. Btw IMO this goes in config.

alexander-soare · 2024-04-08T13:46:07Z

@Cadene many thanks for the review. Bty

…_torchrl' into refactor_act

Cadene

Thanks for this PR :)
I pushed some changes to user/rcadene/2024_03_31_remove_torchrl.
In particular, I passed the test_policies.py with Aloha/Act.
You will need to solve some non-trivial merge issues.
Dont hesitate to call me so that we solve them together.
Thanks!

…_torchrl' into refactor_act

Refactor act

alexander-soare added 16 commits April 2, 2024 19:11

backup wip

2b928ee

backup wip

65ef8c3

Merge remote-tracking branch 'upstream/main' into refactor_act

c7d70a8

backup wip

110ac5f

backup wip

278336a

backup wip

3a4dfa8

backup wip

edb125b

Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_t…

9d77f57

…orchrl' into refactor_act_remove_torchrl

Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_t…

4863e54

…orchrl' into refactor_act_remove_torchrl

Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_t…

8ba88ba

…orchrl' into refactor_act_remove_torchrl

Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_t…

0b8d27f

…orchrl' into refactor_act_remove_torchrl

re-add pre-commit check

9c28ac8

backup wip

1e71196

Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_t…

ab22860

…orchrl' into refactor_act_remove_torchrl

Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_t…

ecc7dd3

…orchrl' into refactor_act_remove_torchrl

backup wip

8d2463f

alexander-soare marked this pull request as draft April 8, 2024 08:02

alexander-soare added 3 commits April 8, 2024 09:25

Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_t…

e982c73

…orchrl' into refactor_act_remove_torchrl

Eval reproduction works with gym_aloha

1bab4a1

ready for review

863f28f

alexander-soare marked this pull request as ready for review April 8, 2024 12:16

alexander-soare changed the title ~~[WIP] Refactor act~~ Refactor act Apr 8, 2024

alexander-soare changed the base branch from main to user/rcadene/2024_03_31_remove_torchrl April 8, 2024 12:18

empty commit

0a721f3

alexander-soare requested a review from Cadene April 8, 2024 12:29