Fix SpatialSoftmax input shape by alexander-soare · Pull Request #150 · huggingface/lerobot

alexander-soare · 2024-05-08T12:45:00Z

What this does

The input shape for SpatialSoftmax should be the crop shape, not the policy's input shape. This was working before by a lucky coincidence (downsampling ended up producing a feature map that was the same size for the cropped vs non-cropped version, but this wouldn't work for larger input images).

Cadene

Left a comment that you could address before merging.

Cadene · 2024-05-08T13:19:18Z

            feat_map_shape = tuple(
-                self.backbone(torch.zeros(size=(1, *config.input_shapes["observation.image"]))).shape[1:]
+                self.backbone(
+                    torch.zeros(size=(1, config.input_shapes["observation.image"][0], *config.crop_shape))
+                ).shape[1:]
            )


Thanks! Just some details on this line. It's difficult to understand.

Could it be better to break it into a few lines?

create Input

run forward -> get output

get shape

Also to explain why we get first dimension [0]

config.input_shapes["observation.image"][0]

And why do we use size=? It's quite unusual.

Finally, I am wonder why we dont create our torch.zeros tensor with the device argument.

I broke it down. To answer your questions:

And why do we use size=? It's quite unusual.

I just like being explicit with this arg because in numpy you sometimes don't have it first. My habit.

Finally, I am wonder why we dont create our torch.zeros tensor with the device argument.

The device is not specified at this point so we default to CPU.

Fix SpatialSoftmax input shape

14ccb9a

Cadene approved these changes May 8, 2024

View reviewed changes

revision

393c132

alexander-soare merged commit f5de57b into huggingface:main May 8, 2024

alexander-soare deleted the fix_spatial_softmax_input_shape branch May 8, 2024 13:57

menhguin pushed a commit to menhguin/lerobot that referenced this pull request Feb 9, 2025

Fix SpatialSoftmax input shape (huggingface#150)

220499d

Kalcy-U referenced this pull request in Kalcy-U/lerobot May 13, 2025

Fix SpatialSoftmax input shape (#150)

f209c48

ZoreAnuj pushed a commit to luckyrobots/lerobot that referenced this pull request Jul 29, 2025

Fix SpatialSoftmax input shape (huggingface#150)

1337693

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix SpatialSoftmax input shape#150

Fix SpatialSoftmax input shape#150
alexander-soare merged 2 commits into
huggingface:mainfrom
alexander-soare:fix_spatial_softmax_input_shape

alexander-soare commented May 8, 2024

Uh oh!

Cadene left a comment

Uh oh!

Cadene May 8, 2024

Uh oh!

alexander-soare May 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

alexander-soare commented May 8, 2024

What this does

Uh oh!

Cadene left a comment

Choose a reason for hiding this comment

Uh oh!

Cadene May 8, 2024

Choose a reason for hiding this comment

Uh oh!

alexander-soare May 8, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants