Image resizing and point coordinates

Hi!

I am super interested in the pointing functionality but haven't seen anyone asked about this detail:

**When you resize the images in the preprocessor, do you also rescale the "_point coordinates_" accordingly?** It feels to be the right way but from the fact that the code handles the formatter before resizing the images: 
first do data formatting
https://github.com/allenai/molmo/blob/793fa387edfd6fd0f5b21eb8e0a7620a1f3799e1/olmo/data/model_preprocessor.py#L836
then call multimodal processor
https://github.com/allenai/molmo/blob/793fa387edfd6fd0f5b21eb8e0a7620a1f3799e1/olmo/data/model_preprocessor.py#L841

 (I might certainly miss some details!!), looks like the points' coordinates are kept to its original value and serialized into texts.
Can you share some more insights on this? Thanks a lot!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Image resizing and point coordinates #52

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Image resizing and point coordinates #52

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions