Add test and docs for multimodal tool responses by qgallouedec · Pull Request #5448 · huggingface/trl

qgallouedec · 2026-04-03T16:58:14Z

Add test test_training_with_tools_multimodal_response covering tools that return images introduced in Support multimodal tool responses in environment_factory for VLM training #5323
Add "Multimodal Tool Responses" subsection to the Agent Training docs with a code example

Note

Low Risk
Low risk: changes are limited to documentation and an additional unit test covering image-returning tool outputs; no production training logic is modified.

Overview
Adds documentation for multimodal tool outputs in GRPO agent training, showing that tools can return a list of {type: image/text} blocks and that images are injected into subsequent VLM turns.

Adds a new vision-gated GRPO trainer test (test_training_with_tools_multimodal_response) that mocks generation to trigger tool calls and verifies multimodal tool responses (PIL image + text) train successfully and log expected tool call/failure frequencies.

^{Reviewed by Cursor Bugbot for commit c40f4f5. Bugbot is set up for automated code reviews on this repo. Configure here.}

HuggingFaceDocBuilderDev · 2026-04-03T17:00:53Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sergiopaniego

thanks!!

Add test and docs for multimodal tool responses

519ec2d

qgallouedec requested review from AmineDiro, albertvillanova, kashif and sergiopaniego April 3, 2026 16:58

Merge branch 'main' into multimodal-tool-responses-doc-and-test

d29e518

qgallouedec and others added 2 commits April 3, 2026 13:42

Merge branch 'main' into multimodal-tool-responses-doc-and-test

a8ae419

Merge branch 'main' into multimodal-tool-responses-doc-and-test

c40f4f5

sergiopaniego approved these changes Apr 6, 2026

View reviewed changes

qgallouedec merged commit 5c22894 into main Apr 6, 2026
14 checks passed

qgallouedec deleted the multimodal-tool-responses-doc-and-test branch April 6, 2026 13:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add test and docs for multimodal tool responses#5448

Add test and docs for multimodal tool responses#5448
qgallouedec merged 4 commits into
mainfrom
multimodal-tool-responses-doc-and-test

qgallouedec commented Apr 3, 2026 •

edited by cursor Bot

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Apr 3, 2026

Uh oh!

sergiopaniego left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

qgallouedec commented Apr 3, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Apr 3, 2026

Uh oh!

sergiopaniego left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

qgallouedec commented Apr 3, 2026 •

edited by cursor Bot

Loading