-
-
Notifications
You must be signed in to change notification settings - Fork 7.6k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Hardware][HPU] correct inference_mode method signature
#18551
opened May 22, 2025 by
andyxning
Loading…
[Feature] Support multiple api keys in server
frontend
#18548
opened May 22, 2025 by
Yanpas
Loading…
[Bugfix] handle zero-token batches by creating empty attention metadata for KV cache load
v1
#18545
opened May 22, 2025 by
hammersam
Loading…
[Hardware][CPU] Update intel_extension_for_pytorch 2.7.0 and move to
requirements/cpu.txt
ci/build
#18542
opened May 22, 2025 by
yankay
Loading…
[Model][Speculative Decoding] Integrate PARD into vLLM
speculative-decoding
#18541
opened May 22, 2025 by
zihaoanllm
Loading…
Use prebuilt FlashInfer x86_64 PyTorch 2.7 CUDA 12.8 wheel for CI
ci/build
#18537
opened May 22, 2025 by
huydhn
Loading…
[Neuron] Remove bypass on EAGLEConfig and add a test
#18514
opened May 22, 2025 by
elaineyz
Loading…
[Misc][Benchmark] Add support for CustomDataset
#18511
opened May 21, 2025 by
ekagra-ranjan
Loading…
Fix: Proper RGBA -> RGB conversion for PIL images.
multi-modality
Related to multi-modality (#4194)
#18508
opened May 21, 2025 by
ChenheliHua
Loading…
[Doc] Update quickstart and install for cu128 using Improvements or additions to documentation
--torch-backend=auto
documentation
#18505
opened May 21, 2025 by
mgoin
Loading…
[Bugfix] Fix spec decode on non-cuda platforms
speculative-decoding
#18501
opened May 21, 2025 by
rand-fly
Loading…
[VLM] Initialize video input support for InternVL models
documentation
Improvements or additions to documentation
frontend
multi-modality
Related to multi-modality (#4194)
#18499
opened May 21, 2025 by
Isotr0py
Loading…
Add Apple Silicon bf16 Support
ci/build
#18497
opened May 21, 2025 by
rahuja23
Loading…
3 tasks done
Enable hybrid attention models for Transformers backend
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#18494
opened May 21, 2025 by
hmellor
Loading…
[Spec Decode] Make EAGLE3 draft token ID mapping optional
v1
#18488
opened May 21, 2025 by
benchislett
Loading…
[Misc] refactor: simplify input validation and num_requests handling in _convert_v1_inputs
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
#18482
opened May 21, 2025 by
googs1025
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.