[SWP] Support mixing latency=0 with compiler derived latencies #534

njriasan · 2025-10-30T20:02:53Z

In an example kernel we saw a regression arise from change to how the SWP schedule is generated in the presence of a jagged tensor bias. Its unclear the exact cause of this change (could have been schedule changes or op backtracking that enabled picking the op), but in either case the ideal solution is to disable SWP for the bias loads.

To do this we want to modify the kernel to set latency=0, but still derive all other loads in the compiler. This PR does this with the following process:

When loads are annotated by the user we only "skip" if any of these loads are non-zero.
In the MMA side we omit updating anything but the self latency if the op was annotated.
In the load we remove the load loadOpToIndLevel calculation if it was annotated. This will both ensure that "distance" calculations omit this load and that the load's latency is not modified.

To explain in slightly more detail the issue, the latency that is assigned is based on the longest load path. The jagged tensor being considered made the longest path 2, so now every load pipelined num_stages / 2. This lead to a regression because the other loads needed more pipelining (and the bias did not need pipelining). By setting the loads on that path to 0 you retain the original schedule where the loads are all pipelined by the full num_stages.

njriasan · 2025-10-30T20:06:47Z

I've started a discussion with Thomas about allowing this upstream. I'll look into merging it once I can upstream the latency annotation information.

njriasan added 4 commits October 30, 2025 12:20

Added base implementation, need to fix lit test

5535328

Fixed typos

0cba95d

Space comments

a5f88b2

Fixed opLatency bug

aa88fec

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 30, 2025

njriasan requested review from dhruvak3 and manman-ren October 30, 2025 20:03

njriasan changed the base branch from ws-main to main November 16, 2025 03:17

njriasan changed the base branch from main to ws-main November 16, 2025 03:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SWP] Support mixing latency=0 with compiler derived latencies #534

[SWP] Support mixing latency=0 with compiler derived latencies #534

Uh oh!

njriasan commented Oct 30, 2025

Uh oh!

njriasan commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[SWP] Support mixing latency=0 with compiler derived latencies #534

Are you sure you want to change the base?

[SWP] Support mixing latency=0 with compiler derived latencies #534

Uh oh!

Conversation

njriasan commented Oct 30, 2025

Uh oh!

njriasan commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant