add learnable_fake_quantize in pt2e #3135

navsud · 2025-10-09T03:18:35Z

Summary: Learnable Fake Quantize is a popular technique used especially for low-precision QAT. This was available in fx quantization at https://github.com/pytorch/pytorch/blob/main/torch/ao/quantization/_learnable_fake_quantize.py#L10?, but not in pt2e quantization flow. This change adds learnable fake quantize in pt2e, with some changes, especially related to how channel wise quantization works.

Differential Revision: D83542550

pytorch-bot · 2025-10-09T03:18:38Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3135

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures

As of commit dd004b2 with merge base a5a8fe2 ():

NEW FAILURES - The following jobs have failed:

Run Regression Tests / test (CPU 2.6, linux.4xlarge, torch==2.6.0 --index-url https://download.pytorch.org/whl/cpu, cpu) / linux-job (gh)
test/quantization/pt2e/test_learnable_fake_quantize.py::TestLearnableFakeQuantizeIntegration::test_optimizer_updates_scale_and_zero_point
Run Regression Tests / test (CPU 2.7, linux.4xlarge, torch==2.7.0 --index-url https://download.pytorch.org/whl/cpu, cpu) / linux-job (gh)
test/quantization/pt2e/test_learnable_fake_quantize.py::TestLearnableFakeQuantizeIntegration::test_optimizer_updates_scale_and_zero_point
Run Regression Tests / test (CPU 2.8, linux.4xlarge, torch==2.8.0 --index-url https://download.pytorch.org/whl/cpu, cpu) / linux-job (gh)
test/quantization/pt2e/test_learnable_fake_quantize.py::TestLearnableFakeQuantizeIntegration::test_optimizer_updates_scale_and_zero_point
Run Regression Tests / test-nightly (CPU Nightly, linux.4xlarge, --pre torch --index-url https://download.pytorch.org/wh... / linux-job (gh)
test/quantization/pt2e/test_learnable_fake_quantize.py::TestLearnableFakeQuantizeIntegration::test_optimizer_updates_scale_and_zero_point

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2025-10-09T03:18:42Z

@navsud has exported this pull request. If you are a Meta employee, you can view the originating Diff in D83542550.

jerryzh168 · 2025-10-09T03:34:12Z

torchao/quantization/pt2e/learnable_fake_quantize.py

can you correct the name to fake_quantizer?

@jerryzh168
I'm trying to be consistent with FakeQuantize() class at: https://github.com/pytorch/ao/blob/main/torchao/quantization/pt2e/fake_quantize.py.
I'm ok to change the file name: learnable_fake_quantize.py to learnable_fake_quantizer.py and the class name from LearnableFakeQuantize() to LearnableFakeQuantizer() if that is what you meant.

Yeah this is for pt2e flow, so I feel the name FakeQuantize is more consistent with the other classes?

FakeQuantize is the wrong name I think since it's a verb not noun, it's not consistent with Observer which is a noun. We can correct, and deprecate FakeQuantize itself as we add new things.

But it's OK to do this separately as well

Summary: Learnable Fake Quantize is a popular technique used especially for low-precision QAT. This was available in fx quantization at https://github.com/pytorch/pytorch/blob/main/torch/ao/quantization/_learnable_fake_quantize.py#L10?, but not in pt2e quantization flow. This change adds learnable fake quantize in pt2e, with some changes, especially related to how channel wise quantization works. Differential Revision: D83542550

andrewor14

Looks great, thanks for addressing the comments. Once you fix the tests we can go ahead and merge this

andrewor14 · 2025-10-09T14:23:54Z

torchao/quantization/pt2e/learnable_fake_quantize.py

Yeah this is for pt2e flow, so I feel the name FakeQuantize is more consistent with the other classes?

Summary: Learnable Fake Quantize is a popular technique used especially for low-precision QAT. This was available in fx quantization at https://github.com/pytorch/pytorch/blob/main/torch/ao/quantization/_learnable_fake_quantize.py#L10?, but not in pt2e quantization flow. This change adds learnable fake quantize in pt2e, with some changes, especially related to how channel wise quantization works. Reviewed By: andrewor14 Differential Revision: D83542550

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 9, 2025

meta-codesync bot added fb-exported meta-exported labels Oct 9, 2025

navsud added the topic: new feature Use this tag if this PR adds a new feature label Oct 9, 2025

jerryzh168 requested a review from andrewor14 October 9, 2025 03:32

jerryzh168 reviewed Oct 9, 2025

View reviewed changes

navsud force-pushed the export-D83542550 branch from 38017fc to 60b321a Compare October 9, 2025 03:36

navsud force-pushed the export-D83542550 branch from 60b321a to a2389fc Compare October 9, 2025 03:55

andrewor14 approved these changes Oct 9, 2025

View reviewed changes

navsud force-pushed the export-D83542550 branch from a2389fc to 149bc0b Compare October 9, 2025 17:54

navsud force-pushed the export-D83542550 branch from 149bc0b to dd004b2 Compare October 9, 2025 18:10

meta-codesync bot merged commit 233cfc1 into pytorch:main Oct 9, 2025
15 of 20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add learnable_fake_quantize in pt2e #3135

add learnable_fake_quantize in pt2e #3135

navsud commented Oct 9, 2025

Uh oh!

pytorch-bot bot commented Oct 9, 2025 •

edited

Loading

Uh oh!

meta-codesync bot commented Oct 9, 2025

Uh oh!

jerryzh168 Oct 9, 2025

Uh oh!

navsud Oct 9, 2025

Uh oh!

andrewor14 Oct 9, 2025

Uh oh!

jerryzh168 Oct 9, 2025 •

edited

Loading

Uh oh!

andrewor14 left a comment •

edited

Loading

Uh oh!

andrewor14 Oct 9, 2025

Uh oh!

Uh oh!

Uh oh!

add learnable_fake_quantize in pt2e #3135

add learnable_fake_quantize in pt2e #3135

Conversation

navsud commented Oct 9, 2025

Uh oh!

pytorch-bot bot commented Oct 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3135

❌ 4 New Failures

Uh oh!

meta-codesync bot commented Oct 9, 2025

Uh oh!

jerryzh168 Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

navsud Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

andrewor14 Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Oct 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

andrewor14 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

andrewor14 Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 9, 2025 •

edited

Loading

jerryzh168 Oct 9, 2025 •

edited

Loading

andrewor14 left a comment •

edited

Loading