We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Add rope_scaling to Qwen (vllm-project#1210)
Bump up the version to v0.1.7 (vllm-project#1013)
Bump up the version to v0.1.6 (vllm-project#989)
Bump up the version to v0.1.5 (vllm-project#944)
Bump up the version to v0.1.4 (vllm-project#846)
Bump up version to 0.1.3 (vllm-project#657)
Bump up the version (vllm-project#300)
Bump up version to 0.1.1 (vllm-project#204)
Use slow tokenizer for open llama models (vllm-project#168)
Change plotting script