Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Qwen3-Omni] fixed _get_feat_extract_output_lengths function qwen Related to Qwen models
#31007 opened Dec 19, 2025 by wangxiongts Loading…
5 tasks
[Bugfix] Fix Ray GPU availability warning message v1
#31006 opened Dec 19, 2025 by jarieshan Loading…
5 tasks
[Mics] add pcp basic support to MoE model
#31003 opened Dec 19, 2025 by pisceskkk Loading…
feat(kernel): patch fused_gdn_gating qwen Related to Qwen models
#31002 opened Dec 19, 2025 by OsirisDuan Loading…
5 tasks
Add Molmo2 multimodal model support documentation Improvements or additions to documentation multi-modality Related to multi-modality (#4194) new-model Requests to new models
#30997 opened Dec 19, 2025 by sangho-vision Loading…
4 of 5 tasks
[WIP]improve cpu Benchmark Suite tests for 0.12.0 ci/build cpu Related to CPU backends performance Performance-related issues
#30994 opened Dec 19, 2025 by louie-tsai Loading…
5 tasks
Bump Flashinfer to v0.6.0rc1 ci/build nvidia
#30993 opened Dec 18, 2025 by elvischenv Loading…
5 tasks
[ROCm][CI/Build] Update ROCm dockerfiles ci/build rocm Related to AMD ROCm
#30991 opened Dec 18, 2025 by gshtras Loading…
[MoE Refactor][3/N] Deprecate cutlass block quant fp8 (b200) nvidia ready ONLY add when PR is ready to merge/full CI is needed
#30990 opened Dec 18, 2025 by robertgshaw2-redhat Loading…
5 tasks
Update Pytorch version update docs documentation Improvements or additions to documentation
#30982 opened Dec 18, 2025 by atalman Loading…
blackwell frontend v1
#30981 opened Dec 18, 2025 by dtunikov Loading…
Add positional embedding and kv_cache fusion for llama and gpt-oss gpt-oss Related to GPT-OSS models llama Related to Llama models v1
#30978 opened Dec 18, 2025 by dllehr-amd Draft
5 tasks
Docs: add OpenAI SDK example for Qwen2.5-VL classification documentation Improvements or additions to documentation qwen Related to Qwen models
#30977 opened Dec 18, 2025 by Dhruv-80 Loading…
Use aiter triton fused_add_rmsnorm_pad for gpt-oss gpt-oss Related to GPT-OSS models
#30976 opened Dec 18, 2025 by Rohan138 Draft
5 tasks
[Misc] Disable default --ready-check-timeout-sec extra call in vllm bench performance Performance-related issues
#30975 opened Dec 18, 2025 by NickLucche Loading…
[Bugfix] Fix incorrect tiles creation for mm prefix triton attention ready ONLY add when PR is ready to merge/full CI is needed
#30974 opened Dec 18, 2025 by Isotr0py Loading…
5 tasks
ProTip! Adding no:label will show everything without a label.