-
-
Notifications
You must be signed in to change notification settings - Fork 3.8k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bugfix] Fix InternVL2 vision embeddings process with pipeline parallel
#8299
opened Sep 9, 2024 by
Isotr0py
Loading…
[Bugfix] Fix weight loading issue by rename variable.
ready
ONLY add when PR is ready to merge/full CI is needed
#8293
opened Sep 9, 2024 by
wenxcs
Loading…
[Bugfix] Correct adapter usage for cohere and jamba
ready
ONLY add when PR is ready to merge/full CI is needed
#8292
opened Sep 9, 2024 by
vladislavkruglikov
Loading…
[Bugfix] Mapping physical device indices for e2e test utils
#8290
opened Sep 9, 2024 by
ShangmingCai
Loading…
[not-for-review] test PR multi py ver
ready
ONLY add when PR is ready to merge/full CI is needed
#8253
opened Sep 6, 2024 by
khluu
Loading…
[Bugfix][Frontend] Update all fastapi requests based on OpenAPIBase with annotations
#8251
opened Sep 6, 2024 by
drikster80
•
Draft
[BugFix] Propagate 'trust_remote_code' setting in internvl and minicpmv
#8250
opened Sep 6, 2024 by
zifeitong
Loading…
[Core] support LoRA and prompt adapter in content-based hashing for Block Manager v2 prefix caching
#8240
opened Sep 6, 2024 by
llsj14
Loading…
[BugFix] Fix metrics error for --num-scheduler-steps > 1
ready
ONLY add when PR is ready to merge/full CI is needed
#8234
opened Sep 6, 2024 by
yuleil
Loading…
[Spec Decode] Move ops.advance_step to flash attn advance_step
ready
ONLY add when PR is ready to merge/full CI is needed
#8224
opened Sep 6, 2024 by
kevin314
Loading…
[Misc] Fused MoE Marlin support for GPTQ
ready
ONLY add when PR is ready to merge/full CI is needed
#8217
opened Sep 5, 2024 by
dsikka
Loading…
[Misc] Upgrade vllm-flash-attn to v2.6.2
ready
ONLY add when PR is ready to merge/full CI is needed
#8211
opened Sep 5, 2024 by
WoosukKwon
Loading…
[Model] Adding Granite MoE.
ready
ONLY add when PR is ready to merge/full CI is needed
#8206
opened Sep 5, 2024 by
shawntan
Loading…
[OpenVINO] Enable GPU support for OpenVINO vLLM backend
Intel GPU
#8192
opened Sep 5, 2024 by
sshlyapn
Loading…
[Benchmark] Add block_size option to benchmark_throughput.py
#8175
opened Sep 5, 2024 by
liangfu
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.