Skip to content

Tags: Pradyun92/vllm

Tags

v0.10.2

Toggle v0.10.2's commit message
[CI Failure] Fix test_flashinfer_cutlass_mxfp4_mxfp8_fused_moe (vllm-…

…project#24750)

Signed-off-by: mgoin <mgoin64@gmail.com>

v0.10.2rc3

Toggle v0.10.2rc3's commit message
[Compilation Bug] Fix Inductor Graph Output with Shape Issue (vllm-pr…

…oject#24772)

Signed-off-by: yewentao256 <zhyanwentao@126.com>

ci/build/22474

Toggle ci/build/22474's commit message
Merge branch 'main' into wye-refactor-quant-folder

v0.10.2rc2

Toggle v0.10.2rc2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
[Bugfix] fixes the causal_conv1d_update kernel update non-speculative…

… decoding cases (vllm-project#24680)

Signed-off-by: Tao He <linzhu.ht@alibaba-inc.com>
Co-authored-by: Cyrus Leung <tlleungac@connect.ust.hk>

v0.10.2rc1

Toggle v0.10.2rc1's commit message
this is only used to fix nightly wheel version, not a real release ca…

…ndidate

v0.10.1.1

Toggle v0.10.1.1's commit message
Do not use eval() to convert unknown types (vllm-project#23266)

Signed-off-by: Russell Bryant <rbryant@redhat.com>
Signed-off-by: simon-mo <simon.mo@hey.com>

v0.10.1

Toggle v0.10.1's commit message
Use Blackwell FlashInfer MXFP4 MoE by default if available (vllm-proj…

…ect#23008)

Signed-off-by: mgoin <mgoin64@gmail.com>

v0.10.1rc1

Toggle v0.10.1rc1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: gptq marlin weight loading failure (vllm-project#23066)

v0.10.0

Toggle v0.10.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Add think chunk (vllm-project#21333)

Signed-off-by: Julien Denize <julien.denize@mistral.ai>

v0.10.0rc2

Toggle v0.10.0rc2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Add think chunk (vllm-project#21333)

Signed-off-by: Julien Denize <julien.denize@mistral.ai>