-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][feat] Add MiniMax-M3 PyTorch backend bring-up
#15292
opened Jun 12, 2026 by
WeiHaocheng
Collaborator
Loading…
1 task
[None][test] Skip flaky TestDeepSeekV4Flash::test_auto_dtype disagg o…
deepseek-v4
#15291
opened Jun 12, 2026 by
mingyangHao
Collaborator
Loading…
1 task done
[TRTLLM-13246][feat] Wave 2: stage Linear and Attention transforms
#15288
opened Jun 12, 2026 by
chienchunhung
Collaborator
•
Draft
[#15289][fix] AutoDeploy: Enable chunked prefill for Super V3 MTP
#15287
opened Jun 12, 2026 by
govind-ramnarayan
Collaborator
Loading…
1 task
[None][feat] AutoDeploy: Enable DeepSeek MTP
#15286
opened Jun 12, 2026 by
govind-ramnarayan
Collaborator
•
Draft
1 task
feat(visual_gen): UBX Caliper all-to-all for Ulysses sequence parallelism
#15285
opened Jun 12, 2026 by
pkisfaludi-nv
Loading…
5 tasks
[None][perf] offload chat template rendering into async
#15284
opened Jun 12, 2026 by
yechank-nvidia
Collaborator
•
Draft
[TRTLLM-12950][feat] MegaMoE CuteDSL: import latest kernel + tuning rework
#15283
opened Jun 12, 2026 by
xxi-nv
Collaborator
Loading…
[None][chore] reduce requests for kubernetes SLURM proxy
#15281
opened Jun 11, 2026 by
tburt-nv
Collaborator
Loading…
1 task done
[None][fix] Fix AutoDeploy shim test expecting soft fallback for speculative+flashinfer
#15280
opened Jun 11, 2026 by
achartier
Collaborator
Loading…
1 task done
[TRTLLM-13024][perf] Make chat template application non-blocking
#15278
opened Jun 11, 2026 by
2ez4bz
Collaborator
Loading…
1 task done
[None][feat] Do not review Pipeline cache test
#15276
opened Jun 11, 2026 by
nvchenghaoz
Collaborator
Loading…
1 task done
[None][infra] PLC nightly pipeline update
#15274
opened Jun 11, 2026 by
yuanjingx87
Collaborator
Loading…
1 task
[None][fix] pool-qualify KV cache transfer pending keys
#15272
opened Jun 11, 2026 by
chienchunhung
Collaborator
Loading…
[None][chore] Integration tests for MoE lora & bugfixes
#15271
opened Jun 11, 2026 by
brb-nv
Collaborator
Loading…
1 task done
[None][infra] Waive 1 failed cases for main in pre-merge 42752
#15270
opened Jun 11, 2026 by
ZhanruiSunCh
Collaborator
Loading…
[https://nvbugs/6293536][fix] At the entry of V1 KVCM prepare_resources, when blocks_in_secondary_pool > 0…
#15267
opened Jun 11, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[None][fix] DSv4 MLA overlap: record_stream cross-stream tensors
deepseek-v4
#15265
opened Jun 11, 2026 by
mingyangHao
Collaborator
Loading…
1 task done
[https://nvbugs/6290967][fix] Update the two Cosmos3 quant assertions to read…
#15264
opened Jun 11, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[TRTLLM-13378][feat] Drop legacy --extra_visual_gen_options CLI alias
#15262
opened Jun 11, 2026 by
zhenhuaw-me
Member
Loading…
1 task done
[None][test] CI bisect nvbugs/6280721: probe baseline-2 a8c4007 (Fix AutoDeploy accuracy tests #13925)
#15261
opened Jun 11, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[None][fix] AutoDeploy: set enable_spec_decode on ADEngine for disagg
#15260
opened Jun 11, 2026 by
Shixiaowei02
Collaborator
Loading…
1 task done
[None][ci] tighten VisualGen CBTS routing
VisualGen
#15259
opened Jun 11, 2026 by
zhenhuaw-me
Member
Loading…
[None][perf] cutedsl grouped/swiglu GEMM: Fix acc pipeline release arrive threads
#15258
opened Jun 11, 2026 by
liyuhannnnn
Collaborator
Loading…
1 task done
[None][perf] DSV4 o_proj: fuse fp8/UE8M0 quantize into o_a CuTe-DSL e…
deepseek-v4
#15257
opened Jun 11, 2026 by
mingyangHao
Collaborator
•
Draft
1 task done
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.