[optim] Fix bug when default dtype is BF16 by gau-nernst · Pull Request #2286 · pytorch/ao

gau-nernst · 2025-05-31T05:56:08Z

Happened in #2235

When torch.set_default_dtype(torch.bfloat16) is set, qmap = torch.tensor(...) becomes BF16. Hence, dequantizing the low-bit tensor subclass results in BF16, leading to dtype mismatch in Adam logic.

pytorch-bot · 2025-05-31T05:56:11Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2286

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Experiencing "429: Too Many Requests" on downloading actions

⏳ No Failures, 1 Pending

As of commit e658764 with merge base bc68b11 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

gau-nernst · 2025-06-04T03:41:17Z

@msaroufim Can I get a stamp on this? Thank you!

* handle error when default dtype is BF16 * skip FP8 optim on unsupported GPUs

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 31, 2025

gau-nernst added the topic: bug fix Use this tag for PRs that fix bugs label May 31, 2025

gau-nernst requested review from HDCharles and msaroufim May 31, 2025 05:58

gau-nernst added 2 commits June 4, 2025 10:04

handle error when default dtype is BF16

c95947a

skip FP8 optim on unsupported GPUs

e658764

gau-nernst force-pushed the optim/default_dtype branch from c87f106 to e658764 Compare June 4, 2025 02:08

msaroufim approved these changes Jun 4, 2025

View reviewed changes

msaroufim merged commit 3aa9361 into pytorch:main Jun 4, 2025
19 checks passed

gau-nernst deleted the optim/default_dtype branch June 4, 2025 03:53

liangel-02 pushed a commit that referenced this pull request Aug 25, 2025

[optim] Fix bug when default dtype is BF16 (#2286)

a5276e5

* handle error when default dtype is BF16 * skip FP8 optim on unsupported GPUs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[optim] Fix bug when default dtype is BF16#2286

[optim] Fix bug when default dtype is BF16#2286
msaroufim merged 2 commits into
pytorch:mainfrom
gau-nernst:optim/default_dtype

gau-nernst commented May 31, 2025

Uh oh!

pytorch-bot Bot commented May 31, 2025 •

edited

Loading

Uh oh!

gau-nernst commented Jun 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

gau-nernst commented May 31, 2025

Uh oh!

pytorch-bot Bot commented May 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2286

❗ 1 Active SEVs

⏳ No Failures, 1 Pending

Uh oh!

gau-nernst commented Jun 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot Bot commented May 31, 2025 •

edited

Loading