Add algorithm search to FP8 sparse linear benchmark (#4397) by bbeckca · Pull Request #4397 · pytorch/ao

bbeckca · 2026-05-13T23:29:13Z

Summary:

What: Adds algorithm search support (--search-alg) to the FP8 sparse linear benchmark. Threads a new alg_id parameter through the quantization config (Float8DynamicActivationFloat8WeightConfig), the sparse tensor class, and down into the _cslt_sparse_mm kernel call. When --search-alg is passed, the benchmark calls _cslt_sparse_mm_search to find the best algorithm for each shape and benchmarks with that algorithm to report the speedup.

Why: hipSPARSELt supports multiple algorithms for sparse matmul, and the default (alg_id=0) isn't always the fastest. This lets us find the best algorithm for a given shape and measure the performance benefit.

Differential Revision: D102683062

pytorch-bot · 2026-05-13T23:29:18Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/4397

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Run pull request jobs on OSDC runners in shadow mode

❌ 2 New Failures

As of commit 926a428 with merge base 13cd013 ():

NEW FAILURES - The following jobs have failed:

Run Regression Tests / test-nightly (CPU Nightly, linux.4xlarge, --pre torch torchvision --index-url https://download.py... / linux-job (gh)
test/quantization/pt2e/test_representation.py::TestPT2ERepresentation::test_static_linear
Run Regression Tests / test-nightly (CUDA Nightly, linux.g5.12xlarge.nvidia.gpu, --pre torch torchvision --index-url htt... / linux-job (gh)
test/quantization/pt2e/test_representation.py::TestPT2ERepresentation::test_static_linear

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2026-05-13T23:29:21Z

@bbeckca has exported this pull request. If you are a Meta employee, you can view the originating Diff in D102683062.

bbeckca · 2026-05-14T17:56:47Z

@pytorchbot label "module: rocm"

Summary: What: Adds algorithm search support (--search-alg) to the FP8 sparse linear benchmark. Threads a new alg_id parameter through the quantization config (Float8DynamicActivationFloat8WeightConfig), the sparse tensor class, and down into the _cslt_sparse_mm kernel call. When --search-alg is passed, the benchmark calls _cslt_sparse_mm_search to find the best algorithm for each shape and benchmarks with that algorithm to report the speedup. Why: hipSPARSELt supports multiple algorithms for sparse matmul, and the default (alg_id=0) isn't always the fastest. This lets us find the best algorithm for a given shape and measure the performance benefit. Differential Revision: D102683062

bbeckca requested review from andrewor14 and jerryzh168 as code owners May 13, 2026 23:29

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 13, 2026

meta-codesync Bot added fb-exported meta-exported labels May 13, 2026

pytorch-bot Bot added the module: rocm label May 14, 2026

meta-codesync Bot changed the title ~~Add algorithm search to FP8 sparse linear benchmark~~ Add algorithm search to FP8 sparse linear benchmark (#4397) May 14, 2026

bbeckca force-pushed the export-D102683062 branch from 26fe93b to b918f9b Compare May 14, 2026 18:05

bbeckca force-pushed the export-D102683062 branch from b918f9b to 321ecbb Compare May 14, 2026 18:15

bbeckca force-pushed the export-D102683062 branch from 321ecbb to 926a428 Compare May 15, 2026 20:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add algorithm search to FP8 sparse linear benchmark (#4397)#4397

Add algorithm search to FP8 sparse linear benchmark (#4397)#4397
bbeckca wants to merge 1 commit into
pytorch:mainfrom
bbeckca:export-D102683062

bbeckca commented May 13, 2026 •

edited by meta-codesync Bot

Loading

Uh oh!

pytorch-bot Bot commented May 13, 2026 •

edited

Loading

Uh oh!

meta-codesync Bot commented May 13, 2026

Uh oh!

bbeckca commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

bbeckca commented May 13, 2026 • edited by meta-codesync Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/4397

❗ 1 Active SEVs

❌ 2 New Failures

Uh oh!

meta-codesync Bot commented May 13, 2026

Uh oh!

bbeckca commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

bbeckca commented May 13, 2026 •

edited by meta-codesync Bot

Loading

pytorch-bot Bot commented May 13, 2026 •

edited

Loading