-
Notifications
You must be signed in to change notification settings - Fork 115
Pull requests: ROCm/aiter
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix sgl deepseek error with cudagraph plus torch.compile
#1145
opened Oct 9, 2025 by
ZhangLirong-amd
Loading…
1 task
[MI35X] Fix FA bwd result mismatch when sq==1
#1144
opened Oct 9, 2025 by
slippedJim
Loading…
1 task
fix the error in the fwd v3 api when the group mode not supported swa yet
#1139
opened Oct 9, 2025 by
minmengdie
Loading…
1 task
fmha v3 API only generate for supported targeting GPU arch
#1134
opened Oct 7, 2025 by
HollowMan6
Loading…
1 task done
[CK_TILE] Temporarily disable k length=1 test cases in seqence padding
#1129
opened Oct 3, 2025 by
Jeff-Huang
Loading…
[Triton] e2e fused MoE for small N and fp8 blockscale MoE benching
#1126
opened Oct 2, 2025 by
juuso-oskari
Loading…
lean_attention: add GQA support across kernel and wrapper; add tests
#1123
opened Oct 1, 2025 by
kesavanramakrishnan
Loading…
[Triton] FP4 GEMM weight shuffle support and tunning
#1120
opened Sep 30, 2025 by
k50112113
Loading…
[config] tune gemm and moe for qwen3 480b ptpc model on MI308
#1084
opened Sep 25, 2025 by
gbyu-amd
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.