Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Bugfix] Fix gridDim.y overflow for large row counts bug Something isn't working
#45255 opened Jun 11, 2026 by JasonLi314 Loading…
3 tasks done
[MM][CG] Support ViT full CUDA graph for Ernie-4.5-VL image inference documentation Improvements or additions to documentation multi-modality Related to multi-modality (#4194) nvidia
#45254 opened Jun 11, 2026 by qyYue1389 Contributor Loading…
docs: add fix disclosure policy to SECURITY.md documentation Improvements or additions to documentation
#45253 opened Jun 11, 2026 by jperezdealgaba Contributor Loading…
[Security] Fix DoS via prompt_embeds on M-RoPE models v1
#45252 opened Jun 11, 2026 by jperezdealgaba Contributor Loading…
[Bugfix] Restrict FlashInfer cuDNN FP8 ViT attention gate to Blackwell (SM 100) bug Something isn't working nvidia
#45251 opened Jun 11, 2026 by wentian-byte Loading…
1 of 3 tasks
[CI] Enable sccache for Rust build under CUDA/ROCm ci/build intel-gpu Related to Intel GPU nvidia ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm rust
#45246 opened Jun 11, 2026 by BugenZhao Member Loading…
4 tasks
[Bugfix] Pre-compile _zero_kv_blocks_kernel and _compute_slot_mapping_kernel during warmup bug Something isn't working
#45245 opened Jun 11, 2026 by z-priyanshu Loading…
2 of 3 tasks
minicpmv4_6: fix ImageSize (W,H) order for placeholder token calculation
#45244 opened Jun 11, 2026 by tc-mb Contributor Loading…
[RISC-V] Enable BF16 on VLEN=256 hardware ci/build cpu Related to CPU backends
#45243 opened Jun 11, 2026 by velonica0 Contributor Loading…
[XPU][DeepSeek-V4] Fix MTP: sync with upstream fixes #44821 and #43746 deepseek Related to DeepSeek models intel-gpu Related to Intel GPU
#45240 opened Jun 11, 2026 by majian4work Contributor Loading…
fix(ep): use floor/ceil for n_local_physical_experts bookkeeping in DeepSeek-V2 and Qwen3-MoE ci/build cpu Related to CPU backends deepseek Related to DeepSeek models documentation Improvements or additions to documentation frontend needs-rebase nvidia qwen Related to Qwen models rocm Related to AMD ROCm v1
#45239 opened Jun 11, 2026 by max-amos Draft
[ROCm] Enable ROCm Attention Sinks and Connector-Friendly KV Layouts documentation Improvements or additions to documentation kv-connector rocm Related to AMD ROCm v1
#45234 opened Jun 11, 2026 by AndreasKaratzas Member Draft
Bump the minor-update group across 1 directory with 150 updates ci/build dependencies Pull requests that update a dependency file nvidia rocm Related to AMD ROCm
#45233 opened Jun 11, 2026 by dependabot Bot Loading…
[Bugfix][KV-transfer] MoRIIO: READ-mode stability fixes (completion IDs, DP routing, drain, keepalive) bug Something isn't working documentation Improvements or additions to documentation kv-connector v1
#45230 opened Jun 11, 2026 by chaeminlim-mb Draft
3 of 4 tasks
ProTip! no:milestone will show everything without a milestone.