-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][fix] fix tinygemm barrier bug
#15338
opened Jun 13, 2026 by
yweng0828
Collaborator
Loading…
1 task done
[None][test] Waive 23 failed cases for main in QA CI
#15337
opened Jun 13, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[None][test] CI-only: probe gb200 deepseek-v32 perf-sanity at 5f106dfa (DO NOT MERGE)
#15336
opened Jun 13, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[TRTLLM-12807][test] Guard thop attention kwarg aliases
#15335
opened Jun 13, 2026 by
yuxianq
Collaborator
Loading…
1 task done
[None][test] Waive 5 failed cases for main in QA CI
#15334
opened Jun 13, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[https://nvbugs/6281014][fix] fix the repeated cute.compile and simpilify the test
#15331
opened Jun 13, 2026 by
JadoTu
Collaborator
Loading…
1 task done
[#15327][feat] Add per-request priority support to OpenAI chat/completions
#15329
opened Jun 12, 2026 by
sopwg612
Loading…
1 task done
[https://nvbugs/6306936][test] Re-enable AutoDeploy disagg tests
#15325
opened Jun 12, 2026 by
govind-ramnarayan
Collaborator
Loading…
1 task done
[TRTLLM-12721][perf] Remove ready-ID transfer gathers
#15324
opened Jun 12, 2026 by
chienchunhung
Collaborator
•
Draft
[None][test] Fix Mamba hybrid transceiver helper
#15323
opened Jun 12, 2026 by
chienchunhung
Collaborator
Loading…
[None][chore] Small cleanups to MultimodalModelMixin
#15322
opened Jun 12, 2026 by
2ez4bz
Collaborator
Loading…
1 task done
[https://nvbugs/6193854][fix] PR #14851 already removed the bad
is_sliding_window/mMaxSeqLenKv logic on…
#15321
opened Jun 12, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[None][test] Waive 1 failed cases for main in QA CI
#15320
opened Jun 12, 2026 by
tensorrt-cicd
Collaborator
•
Draft
[None][test] Waive 3 failed cases for main in QA CI
#15319
opened Jun 12, 2026 by
tensorrt-cicd
Collaborator
Loading…
[None][fix] visual_gen/wan: emulate eager precision casts under torch.compile
#15318
opened Jun 12, 2026 by
karljang
Collaborator
Loading…
[None][test] Waive 1 failed cases for main in QA CI
#15315
opened Jun 12, 2026 by
tensorrt-cicd
Collaborator
Loading…
feat(visual_gen): NIXL P2P transport for ring-attention KV exchange
#15314
opened Jun 12, 2026 by
pkisfaludi-nv
Loading…
5 tasks
[None][feat] Add PyTorch reset_prefix_cache API
api-compatible
Accepted LLM API contract change that is backwards-compatible
[https://nvbugs/6270671][fix] Replace the hardcoded multiBlock=1 with a call to…
#15312
opened Jun 12, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[None][fix] disagg: use ctx retry id for gen request
#15309
opened Jun 12, 2026 by
liji-nv
Collaborator
Loading…
1 task done
[None][feat] Kv transfer p2p path
#15308
opened Jun 12, 2026 by
chuangz0
Collaborator
Loading…
1 task done
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.