chore(deps-dev): update vllm requirement from >=0.22.0 to >=0.22.1 by dependabot[bot] · Pull Request #440 · OpenBMB/UltraRAG

dependabot · 2026-06-10T11:23:49Z

Updates the requirements on vllm to permit the latest version.

Release notes

v0.22.1

Highlights

This release features 8 commits from 6 contributors (1 new)!

v0.22.1 is a patch release on top of v0.22.0 with targeted bug fixes plus a couple of additions: new model support for JetBrains' Mellum v2, zentorch-accelerated quantized linear inference on AMD Zen CPUs, and fixes for multi-node Ray data-parallel serving, DeepSeek-V4 initialization, and a few model-loading regressions.

Model Support

New model: JetBrains' Mellum v2, an open-weights Mixture-of-Experts code-generation model (#43992).

DeepSeek-V4: resolve a CUTLASS fmin compatibility issue that broke initialization (0decac0d).

Fix OlmoHybridForCausalLM failing to initialise after the checkpoint changed rope_parameters from None to {"rope_type": None} (#43846).

Fix HyperCLOVAX loading after the upstream HuggingFace repo removed its remote code (now native in transformers >= 5.9.0): register the hyperclovax model_type so vLLM uses its vendored config instead of the stale auto_map (#43860).

Hardware & Performance

AMD Zen CPUs: route W8A8 (int8 dynamic-symmetric) and W4A16 (GPTQ) linear inference through zentorch kernels, registered ahead of the generic oneDNN CPU kernels, with transparent fallback on non-Zen CPUs, GPUs, and XPU (#41813).

Large Scale Serving

Fix a deterministic hang in multi-node Ray data-parallel serving with num_api_servers > 1 by excluding the Ray DP backend from the deferred (kernel-assigned) port allocation introduced in #42585 (#43864).

Build & CI

Docker: stop installing flashinfer-jit-cache via --extra-index-url while it is quarantined on PyPI, fixing image builds (#44366).

Normalize NIXL KV-connector wheel installs so only the wheel matching the image's CUDA major is kept, fixing ImportError: libcudart.so.12 when importing nixl_ep on CUDA 13 images (#44266).

Contributors

@khluu, @vadiklyutiy, @aadwived, @shadeMe, @alec-flowers, @hmellor

New Contributors

@aadwived made their first contribution in vllm-project/vllm#41813

Commits

0decac0 fix: resolve CUTLASS fmin compatibility for DeepSeek-V4 init
fd56c57 Fix OlmoHybridForCausalLM not initialising (#43846)
7285178 [Bugfix] Fix HyperCLOVAX CI failure after upstream removed remote code (#43860)
27509c8 [Bugfix][CI] Normalize NIXL connector CUDA wheel installs (#44266)
b284862 [docker] Stop using extra-index-url for flashinfer-jit-cache (#44366)
932dfd5 [Feature] Add support for JetBrains' Mellum v2 code generation model (#43992)
682ffeb [CPU][Zen] Route W8A8 and W4A16 linear inference through zentorch on AMD Zen ...
1be7a57 [Bugfix] Exclude Ray DP from #42585's deferred port allocation (#43864)
See full diff in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR
@dependabot recreate will recreate this PR, overwriting any edits that have been made to it
@dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Updates the requirements on [vllm](https://github.com/vllm-project/vllm) to permit the latest version. - [Release notes](https://github.com/vllm-project/vllm/releases) - [Changelog](https://github.com/vllm-project/vllm/blob/main/RELEASE.md) - [Commits](vllm-project/vllm@v0.22.0...v0.22.1) --- updated-dependencies: - dependency-name: vllm dependency-version: 0.22.1 dependency-type: direct:development ... Signed-off-by: dependabot[bot] <support@github.com>

dependabot Bot added dependencies Pull requests that update a dependency file python Pull requests that update python code labels Jun 10, 2026

xhd0728 approved these changes Jun 12, 2026

View reviewed changes

xhd0728 merged commit f334fa7 into main Jun 12, 2026
3 checks passed

dependabot Bot deleted the dependabot/pip/vllm-gte-0.22.1 branch June 12, 2026 05:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(deps-dev): update vllm requirement from >=0.22.0 to >=0.22.1#440

chore(deps-dev): update vllm requirement from >=0.22.0 to >=0.22.1#440
xhd0728 merged 1 commit into
mainfrom
dependabot/pip/vllm-gte-0.22.1

dependabot Bot commented on behalf of github Jun 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dependabot Bot commented on behalf of github Jun 10, 2026

v0.22.1

Highlights

Model Support

Hardware & Performance

Large Scale Serving

Build & CI

Contributors

New Contributors

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant