Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[Model] support minicpm3
#8297 opened Sep 9, 2024 by SUDA-HLT-ywfang Loading…
[Misc] Added num_cumulative_preemption metrics
#8294 opened Sep 9, 2024 by zeroorhero Loading…
[Bugfix] Fix weight loading issue by rename variable. ready ONLY add when PR is ready to merge/full CI is needed
#8293 opened Sep 9, 2024 by wenxcs Loading…
[Bugfix] Correct adapter usage for cohere and jamba ready ONLY add when PR is ready to merge/full CI is needed
#8292 opened Sep 9, 2024 by vladislavkruglikov Loading…
[Bugfix] Fix LongRoPE bug
#8254 opened Sep 7, 2024 by garg-amit Loading…
[not-for-review] test PR multi py ver ready ONLY add when PR is ready to merge/full CI is needed
#8253 opened Sep 6, 2024 by khluu Loading…
[Model] Support multiple images for qwen-vl
#8247 opened Sep 6, 2024 by alex-jw-brooks Loading…
[Kernel] Build flash-attn from source
#8245 opened Sep 6, 2024 by ProExpertProg Loading…
[BugFix] Fix metrics error for --num-scheduler-steps > 1 ready ONLY add when PR is ready to merge/full CI is needed
#8234 opened Sep 6, 2024 by yuleil Loading…
[Spec Decode] Move ops.advance_step to flash attn advance_step ready ONLY add when PR is ready to merge/full CI is needed
#8224 opened Sep 6, 2024 by kevin314 Loading…
[Misc] Fused MoE Marlin support for GPTQ ready ONLY add when PR is ready to merge/full CI is needed
#8217 opened Sep 5, 2024 by dsikka Loading…
[Misc] Upgrade vllm-flash-attn to v2.6.2 ready ONLY add when PR is ready to merge/full CI is needed
#8211 opened Sep 5, 2024 by WoosukKwon Loading…
Fix shutdown problem
#8209 opened Sep 5, 2024 by Bye-legumes Loading…
[Model] Adding Granite MoE. ready ONLY add when PR is ready to merge/full CI is needed
#8206 opened Sep 5, 2024 by shawntan Loading…
Reshape cache to be XQA kernel compatible
#8200 opened Sep 5, 2024 by wenscarl Loading…
ProTip! Filter pull requests by the default branch with base:main.