feat: add Gemma/Gemma2 training chat templates with generation markers by ps-abhi · Pull Request #5523 · huggingface/trl

ps-abhi · 2026-04-11T17:27:11Z

What does this PR do?

Adds {% generation %} markers for Gemma/Gemma2 training chat templates. Part of #5471.

gemma.jinja: unmodified reference template (from google/gemma-7b-it).
gemma_training.jinja: training variant. Split the unified output line into role-specific branches so <start_of_turn>model\n stays outside the generation block and assistant content + <end_of_turn>\n goes inside.
Modified get_training_chat_template to return the training variant.
Gemma and Gemma2 ship identical chat templates, so one file + one branch covers both.
New TestGetTrainingChatTemplateGemma class, parametrized over both tiny fixtures, asserting text equivalence vs the original and mask correctness across multi-turn conversations.

One extra change worth flagging: reordered the and in the no-patching-needed check so the substring check runs before is_chat_template_prefix_preserving. Gemma's template raises TemplateError on tool-role probes, and without the reorder the Gemma branch is unreachable. Happy to move this to a separate PR if preferred.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline, Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link to it if that's the case. — Tracking: Add {% generation %} chat templates for common model families #5471
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

AI writing disclosure

No AI usage: the PR was written entirely by a human.
AI-assisted: some parts were suggested or improved by AI, but the PR was written and reviewed by a human.
AI-generated: the PR was mostly or fully generated by an AI tool.

Who can review?

@qgallouedec (created #5471)

Note

Medium Risk
Medium risk because it changes chat-template selection logic and tool-calling detection fallbacks, which can affect how prompts/masks are generated during training across multiple model families.

Overview
Adds Gemma/Gemma2 support to TRL’s chat-template patching by introducing a reference gemma.jinja and a new gemma_training.jinja that wraps only the assistant-generated portion in {% generation %} markers (keeping <start_of_turn>model\n outside) for correct assistant_only_loss masking.

Updates get_training_chat_template() to recognize Gemma templates and to skip prefix-preservation checks for templates that don’t support tool messages, and hardens supports_tool_calling() with a TypeError fallback for templates that reject dict tool arguments (notably DeepSeek-V3). Tests and docs are adjusted accordingly (new Gemma/Gemma2 fixtures in TestGetTrainingChatTemplate, skips for tool-less templates, and removal of the DeepSeek tool-calling xfail).

^{Reviewed by Cursor Bugbot for commit 1dad468. Bugbot is set up for automated code reviews on this repo. Configure here.}

HuggingFaceDocBuilderDev · 2026-04-22T16:50:58Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit b4e5485. Configure here.}

qgallouedec

thanks!

ps-abhi and others added 5 commits April 11, 2026 22:34

feat: add Gemma/Gemma2 training chat templates with generation markers

f1a87a5

Merge branch 'main' into gemma-training-template

6e3128e

Merge branch 'main' into gemma-training-template

57c9f59

Merge branch 'main' into gemma-training-template

553ec27

Merge branch 'main' into gemma-training-template

ce53bb4

qgallouedec mentioned this pull request Apr 22, 2026

Tracking: Add {% generation %} chat templates for common model families #5471

Open

24 tasks

Merge branch 'main' into gemma-training-template

a7e8da5

cursor Bot reviewed Apr 22, 2026

View reviewed changes

Comment thread tests/test_chat_template_utils.py Outdated

various fixes

ba722ae

nit

b4e5485

cursor Bot reviewed Apr 22, 2026

View reviewed changes

Comment thread trl/chat_template_utils.py

qgallouedec and others added 3 commits April 22, 2026 17:09

no xfail anymore

b6ec569

Merge branch 'main' into gemma-training-template

b9fe16a

style

1dad468

qgallouedec approved these changes Apr 22, 2026

View reviewed changes

qgallouedec merged commit 3256995 into huggingface:main Apr 22, 2026
1 check passed

hwanython mentioned this pull request Apr 30, 2026

Add Gemma 3 training chat template #5685

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Gemma/Gemma2 training chat templates with generation markers#5523

feat: add Gemma/Gemma2 training chat templates with generation markers#5523
qgallouedec merged 11 commits into
huggingface:mainfrom
ps-abhi:gemma-training-template

ps-abhi commented Apr 11, 2026 •

edited by cursor Bot

Loading

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Apr 22, 2026

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

qgallouedec left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ps-abhi commented Apr 11, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

AI writing disclosure

Who can review?

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Apr 22, 2026

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

qgallouedec left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ps-abhi commented Apr 11, 2026 •

edited by cursor Bot

Loading