Tags: vllm-project/vllm
Toggle v0.9.0.1's commit message
[BugFix] FA2 MLA Accuracy Issue (#18807 )
Signed-off-by: LucasWilkinson <lwilkinson@neuralmagic.com>
Toggle v0.9.0's commit message
[Bugfix] Mistral tool calling when content is list (#18729 )
Signed-off-by: mgoin <mgoin64@gmail.com>
Toggle v0.8.5.post1's commit message
[BugFix][Attention] Fix sliding window attention in V1 giving incorre…
…ct results (#17574 )
Signed-off-by: Lucas Wilkinson <lwilkinson@neuralmagic.com>
Toggle v0.8.5's commit message
[Model] Add tuned triton fused_moe configs for Qwen3Moe (#17328 )
Signed-off-by: mgoin <mgoin64@gmail.com>
Toggle v0.8.4's commit message
[Core][V0] Enable regex support with xgrammar (#13228 )
Signed-off-by: Russell Bryant <rbryant@redhat.com>
Toggle v0.8.3's commit message
Revert "[V1] DP scale-out (1/N): Use zmq ROUTER/DEALER sockets for in…
…put queue (#15906 )"
This reverts commit 651cf0f .
Toggle v0.8.3rc1's commit message
[V1][Spec Decode] Update N-gram Proposer Interface (#15750 )
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
Toggle v0.8.2's commit message
[V1][Spec Decode] Update target_logits in place for rejection sampling (
#15427 )
Signed-off-by: Woosuk Kwon <woosuk.kwon@berkeley.edu>
Toggle v0.8.1's commit message
[V1] Minor V1 async engine test refactor (#15075 )
Signed-off-by: andoorve <murali.andoorveedu@mail.utoronto.ca>
Co-authored-by: andoorve <murali.andoorveedu@mail.utoronto.ca>
Toggle v0.8.0's commit message
[Bugfix] Fix LoRA extra vocab size (#15047 )
Signed-off-by: Jee Jee Li <pandaleefree@gmail.com>
You can’t perform that action at this time.