-
Notifications
You must be signed in to change notification settings - Fork 603
Pull requests: tile-ai/tilelang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add assert that block_K is a multiple of micro_size_k in CUDA MMA GEMM backends to prevent silent miscompilation.
#2390
opened Jun 12, 2026 by
Federicorao
Loading…
[WIP] [DO NOT REVIEW] [Backend] Support TMA lowering for arbitrary (swizzled) SMEM layout
#2380
opened Jun 11, 2026 by
Yongqi-Zhuo
Collaborator
•
Draft
[Autotune] Stabilize autotune benchmarking across devices and fix the unconsistency in the autotuning process.
#2370
opened Jun 11, 2026 by
Wazrrr
Contributor
Loading…
[BugFix][CuTeDSL] Fix TileKernels scan, optional-shape, and e5m6 paths
#2369
opened Jun 10, 2026 by
JayceSu98
Contributor
Loading…
[BugFix][Transform] Demote illegal-width residual cp.async to a synchronous copy
#2366
opened Jun 10, 2026 by
Hughshine
Contributor
Loading…
[CUDA] Add SM120 NVF4 block-scale MMA support
#2364
opened Jun 9, 2026 by
qqq-tao
Contributor
Loading…
[Feature] Flash Attention 1SM/2SM kernels with features supported
#2360
opened Jun 9, 2026 by
chengyupku
Contributor
Loading…
3 tasks done
[Fix][Pipeline] Prevent double expansion of shared buffers across sibling pipelines
#2342
opened Jun 5, 2026 by
harelhuang
Contributor
Loading…
3 tasks done
[AMD][ROCm] Fix CI failures on gfx950, gfx1100, gfx1151, and gfx1201
#2326
opened Jun 3, 2026 by
zhangnju
Collaborator
Loading…
[Stacked][Feature] Support NVFP4 Gemm on Blackwell arch (SM100,110,120)
#2324
opened Jun 3, 2026 by
Hale423
Contributor
Loading…
[Backend] Add unified backend resolution policy
#2318
opened Jun 2, 2026 by
SiriusNEO
Collaborator
Loading…
[Enhancement] Add T.tma_copy barrier_rank argument and fold use_2cta into LowerTileOp
#2316
opened Jun 1, 2026 by
Rachmanino
Collaborator
Loading…
[Feature] Support unaligned barrier sync
#2295
opened May 28, 2026 by
Rachmanino
Collaborator
Loading…
Add SM120 dense block-scaled MMA and essential support for SageAttention3
#2253
opened May 23, 2026 by
sepcnt
Contributor
Loading…
[Feature] Support Blackwell FP4(float4_e2m1fn) GEMM for SM100 & SM120
#2182
opened May 11, 2026 by
Hale423
Contributor
Loading…
[Feature][Blackwell] Add SM120 T.float4_e2m1fn FP4 GEMM support.
#2171
opened May 8, 2026 by
TerminusAkivili
Contributor
Loading…
[Feature] Support mutable TMA descriptor and canonicalize usage in examples
#2113
opened Apr 28, 2026 by
Rachmanino
Collaborator
•
Draft
ProTip!
Filter pull requests by the default branch with base:main.