docs(Float8DynamicActivationFloat8WeightConfig): mark as inference-only (#4376) by Anai-Guo · Pull Request #4451 · pytorch/ao

Anai-Guo · 2026-05-29T02:19:41Z

Summary

Closes #4376.

Adds an explicit Note: inference-only block to Float8DynamicActivationFloat8WeightConfig's docstring so users picking up this config for LoRA-on-quantized-base (or any other training workflow) discover up-front that the backward pass will raise RuntimeError: derivative for aten::_scaled_mm is not implemented, and don't have to debug it from a crash.

This is option (1) from the issue — the lowest-risk path. The training-aware variant in option (2) is a separate, larger change.

What changes

torchao/quantization/quant_api.py — one docstring-only block added to Float8DynamicActivationFloat8WeightConfig. Mirrors the existing Note: block style on Float8WeightOnlyConfig a few lines above.

The note:

Names the actual exception users see (derivative for aten::_scaled_mm is not implemented).
Explains the root cause in one line (torch._scaled_mm has no autograd kernel).
Points users at the LoRA-skip filter_fn pattern they almost certainly want next.
Points users at torchao.float8.convert_to_float8_training for the actual float8 training path.
Links back to Float8DynamicActivationFloat8WeightConfig is inference-only — backward fails on torch._scaled_mm; document or add training-aware variant #4376 so future readers can find the original discussion.

No code changes — purely a docstring addition that flows through to the generated API docs.

🤖 Generated with Claude Code

…ly (pytorch#4376)

pytorch-bot · 2026-05-29T02:19:44Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/4451

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

/easycla is not responding

This comment was automatically generated by Dr. CI and updates every 15 minutes.

docs(Float8DynamicActivationFloat8WeightConfig): mark as inference-on…

d8609df

…ly (pytorch#4376)

Anai-Guo requested review from andrewor14 and jerryzh168 as code owners May 29, 2026 02:19

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 29, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(Float8DynamicActivationFloat8WeightConfig): mark as inference-only (#4376)#4451

docs(Float8DynamicActivationFloat8WeightConfig): mark as inference-only (#4376)#4451
Anai-Guo wants to merge 1 commit into
pytorch:mainfrom
Anai-Guo:docs-float8-dyn-act-inference-only-4376

Anai-Guo commented May 29, 2026

Uh oh!

pytorch-bot Bot commented May 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Anai-Guo commented May 29, 2026

Summary

What changes

Uh oh!

pytorch-bot Bot commented May 29, 2026

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/4451

❗ 1 Active SEVs

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant